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LINGUISTIC DETERMINISM AND THE PART OF SPEECH! 


ROGER W. BROWN 
Harvard Universily 


(12), Lee (9), and Hoijer (6) have put for- 

4 ward the view that language is a deter- 
minant of perception and thought. The nature 
of the determining influence exerted by the 
vocabulary of a language is quite clear (1), but 
It is less easy to see how the grammatical fea- 
tures of a language can affect cognition. Yet it 
is just the grammatical differences between 
languages that are most striking and it is their 
determining force that the anthropologist has 
Stressed. This paper undertakes to show how 
one kind of grammatical practice, the alloca- 
tion of words to one or another part of speech, 
does affect cognition. 

The words of a language can be collected 
into classes of formal equivalents which are 
called the parts of speech. Fries (5) has shown 
that English nouns, for example, are words 
acceptable in sentence frames of the type 
(The) is (are) good.” Native speakers 
of English will find it possible to insert “con- 
cert” or “cow” or “truth” in that frame but 
will find “very” or “of” grammatically im- 
possible. Fries has used other sentence frames 
to separate out verbs, adjectives, and adverbs. 
In French, nouns can be further subdivided 
into formal gender classes. In Navaho there 
are more than 20 formal classes for words 
naming different kinds of objects. All of these 
word classes are defined by linguistic science 
in terms of the combinational possibilities of 
forms in a language without reference to the 
meanings of forms. 

So long as these classes are defined in purely 
formal terms they do not suggest important 
cognitive differences. That suggestion comes 
in when we add the semantic correlates of the 
classes, The native speaker of English is likely 
to think of the parts of speech in semantic 
terms. Nouns name substances; verbs name 
Processes; and adjectives name qualities. The 
genders of certain European languages are 
usually called masculine, feminine, and neuter, 
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1 The gist of this paper was presented in a conference 
on “Linguistic meaning” sponsored by the Social 
Science Research Council and held at Yale University, 
May 17-18, 1956. The Laboratory of Social Relations, 

arvard University, financed the research. 


and these are semantic characterizations. The 
object classes of Navaho are usually described 
as words naming round objects, words naming 
long, thin objects, words naming granular sub- 
stances, etc. The linguistic determinists in 
anthropology believe that the semantic char- 
acter of the form classes fixes the fundamental 
conception of reality in a language community 
and that differences on this level correspond to 
different Weltanschauungen. 

At the same time the science of descriptive 
linguistics refuses to define its word classes in 
semantic terms. Fries (5) has shown that for 
the English parts of speech such definitions are 
always either unclear or overextended. We all 
know the English teacher’s characterization of 
the noun as the name of a person, place, or 
thing. The terms “person” and “place” are 
reasonably clear, but do not apply to such 
nouns as “truth,” “odor,” and “thought.” The 
meaning of “thing” is so unclear that we can- 
not tell whether or not odor and thought are 
things. No one has been able to provide clear 
semantic definitions that will serve to distin- 
guish every English noun from every verb, 
adjective, and adverb. It is well known that 
the “masculine” and “feminine” genders in the 
European languages include names for objects 
having no sex. In Navaho, too, the object 
classes do not show perfect semantic consist- 
ency. In short, the semantic definitions of the 
form classes ignore many exceptions and are 
unsuitable for the purposes of linguistic science. 

When the linguistic scientist sets up his 
descriptive categories he quite naturally looks 
for attributes of exceptionless validity, and 
there are not such semantic attributes for the 
English parts of speech. However, the layman 
may operate, in this area as in so many others, 
with conceptions that take account of prob- 
abilistic as well as certain associations. It may 
be that nouns fend fo have a different semantic 
from verbs, and that the native speaker de- 
tects this tendency while he is in the process of 
learning the language. To answer these ques- 
tions examination was made of the nouns and 
verbs of young children learning English to see 
whether there was a semantic distinction be- 
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tween the two parts of speech. The distinction 
proved to be much clearer than it is in the 
vocabulary of English-speaking adults. The 
second step was an experiment to find out 
whether the children were aware of the seman- 
tic distinction between nouns and verbs and 
whether they made any use of the distinction. 


Tue NOUN AND VERB IN CHILDREN’S SPEECH 


Harvard pre-school sessions were visited for 
about a month. There were eight children in 
each class; two of the classes were limited to 
children between four and five years while a 
third class accepted those between three and 
four. As an observer, the author sat on the 
side-lines and let the pre-school life swirl about 
him, recording verbatim all the conversation 
he could hear. From these records, he made 
vocabulary lists classified into the parts of 
speech. It was his impression, on examining 
this vocabulary, that the nouns and verbs of 
children were more nearly consistent with the 
classroom semantic definitions than are the 
nouns and verbs of adults. Nouns commonly 
heard were “truck,” “blocks,” and “teacher.” 
There were no uses of “thought” or “virtue” 
or “attitude.” These observations suggested 
that as the form classes grow larger they de- 
cline in semantic consistency. Perhaps children 
develop firm, and temporarily reliable, notions 
about the semantics of nouns and verbs. These 
notions may stay with them as adults even 
though they retain only a probabilistic truth. 

To compare the character of adult and child 
vocabularies, the first thousand most frequent 
words from the Thorndike-Lorge (11) list of 
adult usage were examined, and also the first 
thousand most frequent words from the Rins- 
land (10) list of the vocabulary of children in 
the first grade. The Rinsland list is based on 
4,630 pages of conversation, plus more than a 
thousand letters and stories. The Rinsland list 
is much the same as lists compiled independ- 
ently by the Child Study Committee of the 
Kindergarten Union (8) and by Horn (7), 

The first set of contrasts deals with two re- 
duced lists; nouns found among the first thou- 
sand for adults but not for children, compared 
with nouns among the first thousand for 
children but not for adults. The set of nouns 
having clearest “thing” character would seem 
to be those that are called “concrete” and it is 
a commonplace to describe the language of 


children as more concrete than that of adults. 
One sense of the pair “concrete-abstract” is 
the same as “subordinate-superordinate.” The 
more abstract term, the superordinate, in- 
cludes in its denotation the denotation of the 
concrete or subordinate term, but extends be- 
yond it. Superordinate-subordinate relations 
between the two lists were all in one direction. 
The adult list included “action,” “article,” 
“body,” “experience,” and at least seven 
others which were superordinate to many 
words on the children’s list. There were no 
nouns on the children’s list superordinate to 
those on the adult list. 

The concrete noun with the smaller denota- 
tion is likely to be more picturable than its 
superordinate, and picturability is another 
common sense of “concrete.” Of course the 
concrete noun, like the abstract, names a cate- 
gory rather than a particular instance. How- 
ever, some categories have a more or less 
characteristic visual contour and size while 
others do not. Visual contour is a defining at- 
tribute for “table,” but not for “thing” OY 
“experience.” Of the adult nouns, 16% named 
categories having a characteristic visual con- 
tour, while 67% of the children’s nouns were 
of this kind. Nouns like “apple,” “barn,” and 
“airplane” name categories for which size is 4 
defining attribute, while nouns like “affair,” 
“amount,” and “action” do not. On the adult 
list, 39% of the nouns were of the former kind, 
while 83% of the children’s nouns had size 
implications. It appears that children’s nouns 
are more likely to name concrete things (in the 
sense of naming narrow categories with charac- 
teristic visual contour and size) than are the 
nouns of adults. 

Two lists of verbs were compared: those 
among the first 1,000 for adults but not for 
children, and those among the first 1,000 for 
children but not for adults. The question her? 
was the percentage of verbs naming anima 
(including human) movement. Of the adult 
verbs, 33% were of this kind, while 67% of the 
children’s verbs named actions. The commo? 
notion that verbs name actions seems tO 
truer for the vocabulary of children than for 
the vocabulary of adults. 

These studies of word lists confirm the im- 
pression that the nouns and verbs used by 
children have more consistent semantic impli- 
cations than those used by adults. It remains 
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a question whether children are, in any sense, 
aware of these implications. There are many 
ways in which such awareness could be useful 
to one learning the language. Adults often try 
to convey the sense of a word by speaking it in 
the presence of the object or event named. All 
such single namings are ambiguous. The adult 
who says “water” while looking at a glass of 
water may cause a child to attend to the glass 
itself as a container, to the glass as a trans- 
parent material, to the liquid character of its 
contents, to the height of the liquid, to the 
state of containment, and so on. Selection of 
the nonlinguistic attributes that govern proper 
denotative use of the word “water” cannot be 
guaranteed by a single naming. Repeated 
pointings can, of course, establish the invariant 
circumstances governing use of the word. If 
there were nothing to suggest to the child the 
probably relevant features of the nonlinguistic 
world, discovery of linguistic meanings would 
be a very laborious affair. However, a new 
word is ordinarily introduced in a way that 
makes its part-of-speech membership clear: 
“Look at the dog” or “See him running.” If a 
part of speech has reliable semantic implica- 
tions it could call attention to the kind of at- 
tribute likely to belong to the meaning of the 
word. A child who had absorbed the semantics 
of the noun and verb would know, the first 
time he heard the word “dog,” that it was 
likely to refer to an object having character- 
istic size and shape, whereas “running” would 
be likely to name some animal motion. The 
part-of-speech membership of the new word 
could operate as a filter selecting for attention 
probably relevant features of the nonlinguistic 
world. It seemed that one could learn whether 
children experience any such filtering of attri- 
butes by introducing to them newly invented 
words assigned to one or another part of 
speech, and then inquiring about the meanings 
the words appeared to have. 

In the children’s speech that had been re- 
corded, nouns and verbs were given proper 
grammatical treatment. In addition, the chil- 
dren made correct use of a subclass of nouns— 
the mass nouns. These are words like “dirt,” 
“snow,” “milk,” and “rice” which are given 
different grammatical treatment from such 
Particular nouns as “barn,” “house,” and 

dog.” For example, when “some” is used 
with “barn” the noun is in the plural—‘‘some 


barns,” whereas a mass noun would be in the 
singular—‘some rice.” The semantic difference 
between these two classes of noun is suggested 
by the designations “mass” and “particular.” 
Mass nouns usually name extended substances 
having no characteristic size or shape, while 
particular nouns name objects having size and 
shape. Many nouns can function in either a 
mass or particular way with attendant shifts 
in the speaker’s view of the referent. “Some 
cake” is a chunk of a mass while “some cakes” 
are either cupcakes or layer cakes arranged in 
a row. Many words in the vocabulary of psy- 
chology have this double potentiality. Al- 
though the personologist deplores such usage, 
the layman speaks of someone having “a lot of 
personality” or “very little temperament.” The 
professional insists that personality is not an 
undifferentiated substance of which one can 
have more or less. Personalities are like cup- 
cakes—all of a size and one to a customer— 
with only their frostings to make them unique. 

In the speech of the pre-school children 
“milk,” and “orange juice,” and “dirt” were 
the most common mass nouns. These were al- 
ways given correct grammatical treatment. No 
one said “a milk” or “some dirts.” It was de- 
cided to work with three functional classes: the 
particular noun, the mass noun, and the verb. 


METHOD 


The experiment involved three sets of four pictures 
each.? One of these sets will be described in detail. The 
first picture in the set shows a pair of hands performing 
a kneading sort of motion, with a mass of red confetti- 
like material which is piled into and overflowing a 
blue-and-white striped container that is round and low 
in shape. The important features of the picture are the 
kneading action, the red mass, and the blue-and-white 
round container. The motion would ordinarily be 
named with a verb (like “kneading”), the mass with a 
mass noun (like “confetti”), and the container with a 
particular noun (like “container”). It was assumed that 
children would have no readily available names for any 
of these conceptions. Each of the remaining three 
pictures of this set exactly reproduced one of the three 
salient features of the first picture, either the motion, 
the mass, or the container. In order to represent the 
motion a second time it was necessary to show also a 
mass and a container. However, the mass was here 
painted yellow so as not to duplicate the original, and 
the container was of a different size, shape, and color 
from the original. The other two sets of pictures in- 
volved different content, but always an action, a mass 


2The author is grateful to Dr. Susan Ervin for 
painting the pictures and for suggesting colors and 
forms that would please children. 
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substance, and a particular object. In one case, the 
first picture showed hands cutting a mass of cloth with 
a strange tool. In the third set, hands were shown 
emptying an odd container of a slushy sort of material. 

In overview, the following use was to be made of the 
three sets of pictures. Children were to be shown the 
first picture in conjunction with a new word identifiable 
either as a verb, a mass noun, or a particular noun. 
Then they would be shown the remaining three pictures 
of the set and asked to point out the one that pictured 
again what had been named in the first picture. It was 
anticipated that when the new word was a verb they 
would point to the picture of motion, when it was a 
particular noun they would point to the container, 
and when it was a mass noun they would point to the 
extended substance. 

Three word stems were used: “niss,” “sib,” and 
“Jatt.” If the stem was to function as a verb X would 
begin by asking: “Do you know what it means to sib?” 
(Children do not always answer “no” as they ought.) 
“In this picture” (first picture of a set) “you can see 
sibbing. Now show me another picture of sibbing” 
(presenting the other three pictures of the set). If the 
stem was to function as a particular noun, X began: 
“Do you know what a sib is?” and proceeded in con- 
sistent fashion. If the word was to function as a mass 
noun, X began: “Have you ever seen any sib?” and 
went on accordingly. 

Each child saw all three sets of pictures and heard 
each of the word stems; one of them as a particular 
noun, one as a mass noun, and one as a verb. The 
combinations of word stem, part-of-speech member- 
ship, picture set, and order of presentation were all 
randomly varied. There were 16 children in all, half of 
them between three and four years, and half between 
four and five. They were all acquainted with X by the 
time the experiment was performed. The procedure was 
very like the familiar business of looking at a picture 
book and naming the things seen and was accepted by 
the children as a kind of game. The game was always 
played with one child at a time. 


RESULTS AND DISCUSSION 


When a new word was introduced as a verb, 
10 of the 16 children picked out the picture of 
movement. When the word was a particular 
noun, 11 of 16 selected the picture of an ob- 
ject; and when the word was a mass noun, 12 
of 16 selected the extended substance. Of the 
15 responses that were not correct, four were 


TABLE 1 


PICTURE SELECTIONS FOR Worps BELONGING TO 
Various Parts or SPEECH 


Category Particular 


M 
Depicted verbs Nouns Mune 
Actions 10 1 0 
Objects 4 11 3 
Substances 1 2 12 
1 2 1 


No response 


simply failures to answer because of some dis- 
traction from the task. The results are sum- 
marized in Table 1. A simple test was made to 
determine the significance of the differences in 
the pictures selected by children when the new 
word was a verb, when the word was a par- 
ticular noun, and when it was a mass noun. 
For example, the selections made when the 
word was a verb were dichotomized into pic- 
tures of actions and all others. These frequen- 
cies for verbs were compared with like fre- 
quencies for the choices made when the words 
were either particular or mass nouns. In other 
words, the test was to determine whether ac- 
tion pictures were more likely to be selected as 
referents for new words introduced as verbs 
than for new words introduced as nouns. 
Comparable tests were made to see whether 
particular nouns were associated with pictures 
of objects and mass nouns with substances. 
All three of the resultant 2 X 2 tables yielded 
differences beyond the .005 level of significance 
when the Fisher-Yates test was applied. 

It is well known that children will sometimes 
do what an adult wishes in a task of this kind 
though they do not understand the task as the 
adult does. Consequently, the qualitative re- 
sults may be more persuasive than the quanti- 
tative. In the first trial with the first child, for 
instance, X showed the picture of cloth being 
cut by an odd tool and said that there was a 
“sib” in the picture. Then went on with: “Can 
you show me another sib?” and while X still 
fumbled with the other three pictures, his sub- 
ject swung around and pointed to the steam 
valve on the end of the radiator saying, 
“There’s a sib.” The pictured tool looked very 
like the steam valve. In another case, X showed 
the picture of confetti-kneading and said, 
“There is some latt in this picture,” where- 
upon his subject said: “The latt is spilling.” 
And it was. 

Recent experiments with phonetic symbol- 
ism (2) and metaphor (3) indicate that seman- 
tic rules are not always arbitrary. A word can 
suggest its meaning because the sound is an 
echo of the sense or because the word had a 
prior meaning which is related to the new 
meaning. The present study suggests that most 
words have an additional kind of “appropri- 
ateness” stemming from their grammatical 
character. While the part-of-speech member- 
ship of a word does not give away the particu- 
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Tema it does suggest the general type 
ae aning, whether action, object, sub- 
ce, or whatever. In learning a language, 
therefore, it must be useful to discover the 
ao correlates of the various parts of 
oo A ae fe discovery enables the learner 
bake “ee membership of a new 
eerie a nst clue to its meaning. The present 
tor ent with very young children who are 
Thing English indicates that in this lan- 
ne at least, the semantic implications of 
2e verb, mass noun, and particular noun are 
discovered by native speakers. It now seems 
Se ak im Si that speakers of other languages 
also know about the semantics of their 
prammatical categories. Since these are strik- 
ngly different in unrelated languages, the 
Speakers in question may have quite different 
Cognitive categories. It remains to be deter- 
mined how seriously and how generally thought 
'S affected by these semantic distinctions. 


SUMMARY 


Descriptive linguistics defines the parts of 
Speech in strictly formal or syntactical terms. 
Nevertheless, the parts of speech usually have 
distinct semantic characteristics. These char- 
acteristics do not hold for all members of the 
Various parts of speech, however, and so can- 
Not serve to define the parts of speech for the 
Purposes of linguistic science. Human beings 
are generally adept at picking up imperfect 
probabilistic implications, and so it may be the 
case that native speakers detect the semantic 
nature of the parts of speech of their language. 
It was shown that the nouns used by young 
English-speaking children were more reliably 
the names of things and their verbs more re- 
liably the names of actions than is the case for 
the nouns and verbs used by English-speaking 
adults. It was shown experimentally that 
young English-speaking children take the part- 
of-speech membership of a new word as a clue 
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to the meaning of the word. In this way, they 
make use of the semantic distinctiveness of the 
parts of speech. It seems likely that speakers 
of languages other than English will also have 
detected the semantic characters of their parts 
of speech. There is a sense, then, in which this 
grammatical feature of a language affects the 
cognition of those who speak the languages. 
Differences between languages in their parts 
of speech may be diagnostic of differences in 
the cognitive psychologies of those who use 
the languages. 
REFERENCES 
1. Brown, R. W. Language and categories. Appendix 
to J. S. Bruner, J. J. Goodnow, & G. A. Austin, 


A study of thinking. New York: Wiley, 1956. 
Brown, R. W., Brack, A. H., & Horowrrz, A. E. 


2. 
Phonctic symbolism in natural languages. J. 
abnorm. soc. Psychol., 1955, 50, 388-393. 

3. Brown, R. W., LEITER, R. A., & Hitpum, D. C. 


Metaphors from musical criticism. J. abnormt. soc. 

Psychol., 54, 347-352. 

4. Bucxincuam, B. R., & Dorcu, E. W. A combined 
word list. Boston: Ginn, 1936. 

5. Frs, C. C. The structure of English. New York: 
Harcourt Brace, 1952. 

6. Hoyer, H. (Ed.) Language in culture. Chicago: 

Univer. of Chicago Press, 1954. 

. Horn, E. The commonest words in the spoken 
vocabulary of children up to and including six 
years of age. In National Society for the Study 
of Education, Twenty-fourth yearbook. Blooming- 
ton, Illinois: Public School Publishing Company, 
1925. 

8. INTERNATIONAL KINDERGARTEN UNION, CHILD 
Srupy Committee. A study of the vocabulary of 
children before entering the first grade. Washington, 
D. C.: The Int. Kindergarten Union, 1928. 

9. LEE, Dorotuy D. Conceptual implications of an 
Indian language. Phil. of Sci., 1938, 5, 89-102. 

10. Rivstanp. H. D. A basic vocabulary of elementary 
school children. New York: Macmillan, 1945. 

11. THORNDIKE, E. L., & Loree, I. The teacher’s word 
book of 30,000 words. New York: Bur. of Publ., 
Teachers College, Columbia Univer., 1944, 

12. Wnorr, B. L. Language, thought, and reality: 

=, writings. Cambridge: Technology Press, 


Received August 17, 1956. 


A STUDY OF ANXIETY IN THE SOCIOPATHIC PERSONALITY! 


DAVID T. LYKKEN 
University of Minnesota 


HE concept of the psychopathic person- 

ality includes so heterogeneous a group 

of behavior disorders as to be at least 
two steps removed from the level of useful 
psychiatric diagnosis. Sociopathic personality 
is a more recent designation (1) which refers 
to a subgroup of these disorders in which the 
pathognomic characteristics are impulsiveness, 
antisocial tendencies, immorality, and a seem- 
ingly self-destructive failure to modify this 
pattern of behavior in spite of repeated painful 
consequences. This category may be regarded 
as a genus composed of phenotypically similar, 
but etiologically distinct, subtypes such as the 
dissocial and the neurotic sociopaths. 

A third species has been described (3, 12, 
13, 14, 17), which may be called primary so- 
ciopathy, in which neither neurotic motiva- 
tions, hereditary taint, nor dissocial nurture 
seem to be determining factors. Cleckley (3) 
has reported the chief clinical characteristic of 
this group as a lack of the normal affective 
accompaniments of experience. If this observa- 
tion is correct, it would point the way toward 
accurate diagnostic isolation of primary sociop- 
athy as well as guiding research into the 
question of its etiology. Classification accord- 
ing to the presence or absence of defective 
emotional reactivity, therefore, satisfies one 
criterion of useful diagnosis in that it shows 
promise of relationship to the as yet unknown 
origins of the disorders to be distinguished. 

The other requirement for useful diagnosis 
is that the criteria of classification must be 
objective. Clinical assessment of the “nor- 
mality of the affective accompaniments of 
experience” is subjective and unreliable. In 
consequence, Cleckley’s work has had as yet 
little real impact on psychiatric practice, By 
expressing this putative defect of the primary 
sociopath in terms of the anxiety construct of 
experimental psychology (18, 19, 20, 21, 22), 


1 Drawn from a thesis submitted to the University of 

i i i f the requirements for 

innesota in partial fulfillment o ts f 
ie degree of Doctor of Philosophy. The author is in- 
debted to his adviser, Professor Ephraim Rosen, and to 
others whose assistance aided in the completion of this 


research. 


it becomes susceptible to quantification and 
empirical test. 

An experimental hypothesis may now be 
formulated. Among persons conventionally 
diagnosed as psychopathic .personality, those 
who closely resemble the syndrome described 
by Cleckley are (a) clearly defective as com- 
pared to normals in their ability to develop 
(i.e. condition) anxiety, in the sense of an antic- 
ipatory emotional response to warning signals 
previously associated with nociceptive stimu- 
lation. Persons with such a defect would also 
be expected to show (b) abnormally little 
manifest anxiety in life situations normally con- 
ducive to this response, and to be (c) relatively 
incapable of avoidance learning under circum- 
stances where such learning can only be ef- 


fected through the mediation of the anxiety 
response. 


METHOD 
The Sample 


The extreme heterogeneity, even on the crudest 
descriptive level, of persons diagnosed as psychopathic 
personalities in various clinical or institutional settings 
complicated the selection of an appropriate experimen- 
tal sample. The institution psychologists? were given a 
list of 14 criteria drawn from Cleckley (3, pp. 355-392) 
and were asked to compare against these criteria those 
inmates diagnosed as psychopathic personality. In- 
mates who, in their opinion, best fitted the Cleckley 
prototype were listed as candidates for experimental 
Group I, the primary sociopathic group. Inmates who 
they felt did not meet the criteria in important respects 
were listed as candidates for experimental Group I, 
designated as the neurotic sociopathic group. In this 


selection process, the psychologists were asked to re- y 


affirm the original diagnosis, discarding from considera- 
tion for either group those inmates who, in their present 
opinion, would not be diagnosed as psychopathic 
personality at all. 

A control Group III of 15 “normals,” roughly com- 
parable in age, intelligence, and socioeconomic back- 


ground, was selected from the University General Col- 
lege and a local high school. 


Group I, composed of 12 males and 7 females, had a 


* The writer is indebted to the administrators and 
to the psychologists of the Minnesota State Reforma- 
tory, St. Cloud, Minnesota; the Minnesota State Re- 
formatory for Women, Shakopee, Minnesota; the 
State Home for Girls, Sauk Centre, Mi 


, innesota; and the 
St. Peter State Hospital, St. Peter, Minnesota. 
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mean age of 21.6 ycars (SD = 4.3), and a mean IQ of 
109.2 (SD = 10.7). Group II included 13 males and 7 
females, had a mean age of 24.5 years (SD = 5.4), and 
a mean IQ of 104.5 (SD = 8.8). For the 10 male and 5 
female normals, the mean age was 19.07 (SD = 3.2), 
and the mean IQ 100.4 (SD = 10.2). None of these 
group differences were significant. 


The Measures and Testing Procedure 


g It was necessary to do the testing at the several 
Institutions under varying conditions. In all cases, 
however, the apparatus was arranged on a large table, 
the experimenter on one side and the subject (S) seated 
comfortably opposite. The S was told that he was 
assisting in a psychological experiment having no bear- 
ing on his personal record and that his performance 
would be treated with strict anonymity. An attempt 
Was made throughout to keep the testing on an informal 
basis. 

As an indicant of manifest anxiety as referred to in 
hypothesis b, an “Anxiety Scale” was constructed ex- 
pressly for this study to supplement the Taylor scale 
and Anxiety Index which appear to be more strongly 
loaded on a factor of neurotic self-description. In this 
new scale, each of the thirty-three items involves two 
statements of activities or occurrences, matched for 
general unpleasantness or undesirability according to a 
modified Thurstone scaling procedure utilizing 15 
college student judges. One activity of each pair is 
unpleasant, presumably because of its frightening or 
embarrassing character (e.g, “making a parachute 
jump” or “knocking over a glass in a restaurant”). The 
paired activity is intended to be onerous but not fright- 
ening (e.g., “digging a big rubbish pit” or “cleaning up a 
spilled bottle of syrup”). The S is required to choose 
that member of each pair which he would prefer as a 
lesser of evils. The degree to which the “frightening” 
alternatives are rejected is interpreted as an index of 
the extent to which anxiety determines behavior choices 
within the range of life situations sampled by this test. 

The booklet form of the MMPI was used and the 
answer sheets scored and K-corrected in the usual way 
(10). The Anxiety Index, or AI, was calculated accord- 
ing to the formula given by Welsh (23). The Heineman 
form (11) of the Taylor scale was given and scored by 
subtracting the number of “anxiety” items rejected as 
“least applies to me” from the number endorsed as 
“most applies to me.” ; 

An avoidance learning test was given to determine 
whether there were group differences in capacity to 
learn on the basis of anxiety reduction (hypothesis c). 
It involved an elaborate, electrically operated mental 
maze which the S was given 20 trials to learn (the 
“manifest task”). At each of the 20 choice points in this 
maze, choice of one of the 4 possible alternatives (al- 
ways an error alternative) gave an electric shock. It 
was intended that social and ego rewards should rein- 
force performance in the manifest task. Performance on 
the “latent task,” which was to avoid the shocked 
alternatives—to err instead on the unshocked alter- 
natives—was presumably reinforced only through 
anxiety reduction. " 

The measure of anxiety conditionability (hypothesis 
a) employed the GSR as the dependent variable. A 


shocking electrode was attached to S’s nondominant 
hand, the GSR electrodes being already in place on the 
dominant hand. The S was told that after the blindfold 
had been replaced, he was to sit as quietly as possible 
for the next 30 to 40 minutes, during which time he 
would periodically hear a buzzer (which was then 
demonstrated) and occasionally receive a brief electric 
shock. When the S was seated comfortably and relaxed 
insofar as possible, the recording apparatus was started 
and the conditioning series (CS) begun. 

Two buzzers were used which were distinguishably 
different in timbre rather than in pitch, the difference 
being one not easily labeled (to minimize verbal medi- 
ation of a discrimination between them). Buzzer No. 1 
was used as the CS and was the only one reinforced; 
buzzer No. 2 was used to test for generalization effects. 
In all cases, stimuli of the conditioning series were pre- 
sented as soon as GSR activity from preceding stimuli 
had subsided, the intertrial interval being therefore not 
constant within or between Ss, but averaging between 
20 and 60 seconds. (This method of stimulus timing 
automatically eliminates temporal conditioning.) When 
turned on, the buzzers sounded for a period of 5 sec- 
onds, controlled by an automatic timer. 

The reinforcing stimulus or UnCS was an electric 
shock from a 700-volt AC supply through two 68,000- 
ohm series resistors, presented automatically for about 
100 milliseconds just before the termination of the CS 
(buzzer No. 1). The shock was applied between an 
electrode on the palm of one hand and the GSR ground 
electrode on the palmar tip of the middle finger of the 
opposite hand. The shock sensation was felt mainly on 
the richly innervated finger tip and was a decidedly 
unpleasant stimulus, producing in most cases a pro- 
nounced startle reaction and in all cases a strong GSR. 

The sequence of trials or stimulus presentations was 
as follows: 

1. To permit the adaptation of unconditioned GSR 
to the buzzers themselves, stimuli were first present- 
ed without shock reinforcement for a total of 10 trials 
in the order 2, 1, 2, 1, S, 2, 2, 1, 1, 1, 1. A single pre- 
liminary shock was given in the series at the point S, 
separated by at least 30 seconds from the buzzers 
occurring before and after it. 

2. Seven consecutive shock-reinforced presentations 
of the CS were given as the conditioning series, followed 
by four more reinforcements interspersed with four 
unreinforced trials with buzzer No. 2 in the order 1, 1, 
4,14, 15, 24,29, 1, 244 

3. A total of 24 extinction trials was then given, the 
two buzzer stimuli being presented in the order 1, 2, 1, 
D 2, 44:2, 1, 1,2, 0, Us Vp Ly 2s Sede 2da Ly Qh, Lyd 
Considering only the CS, buzzer No. 1, the series there- 
fore consisted of 6 prereinforcement trials, 11 rein- 
forced conditioning trials, and 16 extinction trials. 

Skin resistance was measured by a modification of a 
circuit suggested by Flanders (6) which passed an 
electronically regulated constant DC current of 40 
microamperes through S. The electrodes were curved 
discs of Monel metal, 15 mm. in diameter, applied to the 
palmar surface of the distal phalange of the first, second, 
and third fingers of the same hand. The skin surface 
was first scrubbed with alcohol and then coated with 
Sanborn electrode paste. The exosomatic current was 
applied between the first and third fingers, which were 
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also connected to the push-pull input grids of a Sanborn 
Model 126 DC amplifier, driving a Sanborn Model 127 
recording milliameter. The electrode on the second 
finger was connected to amplifier and external ground. 
The instrument was calibrated before each use and 
provided a linear record of resistance and resistance 
change, accurate to less than +50 ohms. 

‘All GSRs were recorded in terms of resistance 
change. A variety of transformations was then applied 
and tested against the usual criteria of normality of 
distribution, correlation with basal resistance, and 
homogeneity of variance across people with respect to 
several test stimuli (2, 8, 9, 16). The result of this 
analysis was that each resistance change was expressed 
as the logarithm of the ratio of that change to the mean 
resistance change produced by the first six electric 
shocks. This unit expresses the galvanic CR as a pro- 
portion of the individual’s UnCR and, for a condition- 
ing study, seems quite appropriate for individual com- 
parisons. 

Three GSR indices were derived from the protocols 
of the conditioning series: (e) GSR Reactivity, which 
is the mean GSR to the CS during the fourth through 
seventh conditioning trials; (b) GSR Conditioning, 
which is equal to (a) minus the mean GSR to the last 
three preconditioning trials and the last three extinction 
trials (this index measures essentially the slope of the 
conditioning curve or the increment actually produced 
by the reinforced trials); (c) GSR Generalization, the 
ratio of the mean GSR to buzzer No. 2 during early 
extinction trials 18, 20, 21, 23 to the mean GSR to buz- 
zer No. 1 during trials 17, 19, 22, 24. 

The testing sequence was as follows: (a) Anxiety 
scale; (b) GSR Conditioning series; (c) Avoidance 
Learning test; (d) MMPI (given during the week fol- 
lowing the foregoing individual testing); (e) Taylor 
Manifest Anxiety Scale, forced-choice form given later 
with the MMPI. 


RESULTS AND Discussion 


Scores on all measures were converted for 
easier comparison to a standard score form 
with each distribution having a grand mean of 


TABLE 1 


Group MEANS ON ALL MEASURES: 
SIGNIFICANCE TEstTs* 


Group 


Measure | | dTest 
E. | I | IN | prob.t 
\ | 
Taylor Scale | 471 | 556 | 462 | 01 
Anxiety Index | 472 | 557 | 464 | 01 
Anxiety Scale | 470 | 511 | 529 | <05 
MMPI Pa-Scale | 532 | 547 | 395 | (95 
Avoidance Learning 461 501 558 | “04 
GSR Reactivity 498 494 | 534| 05 
GSR Conditioning 478 | 483 | 551 05 
Generalization 473 | 542 | 490| — 


>All measures converted to a scale having an over-all mean of 
500 and SD of 100 a ; 
° + Probabilities given are for significance of largest difference 
( Il — I for GSR Conditioning). Significance test was Fes- 
eg.» = 


tinger’s distribution-free ‘d’ test (5). 


500 and a standard deviation of 100. Group 
means on all measures, together with signifi- 
cance test results, are given in Table 1. 

It would clearly be too much to expect of 
the judgments based upon the Cleckley cri- 
teria that they should have perfectly separated 
the psychopathic sample into a “primary” 
species in Group I, and a neurotic or dissocial 
species in Group II. That the separation was 
reasonably good, however, is supported by the 
finding that Group II scored significantly 
higher than the normals on the Taylor scale, 
a great deal of evidence having accumulated 
(4, 7, 15) to indicate that this scale is primarily 
a measure of neurotic maladjustment or neu- 
roticism rather than of anxiety level or anxiety 
reactivity per se. On the MMPI Anxiety Index; 
which like the Taylor scale is unquestionably 
polydimensional with a heavy loading ©? 
neuroticism, Group II again has the highest 
mean, with Group I again only slightly highet 
than Group III. 

In contrast, the Anxiety scale, which was 
designed for this study and which is not loade 
on neuroticism and only negligibly correlate 
with the Taylor scale or the AI, separated the 
groups in a different order. On this test, the 
primary types of Group I show the least an*” 
iety reactivity, significantly less than the 
normals, with Group II falling in between bu 
rather nearer to the Group III mean. This 
result appears to support hypothesis b of this 
study, that the subset of primary sociopaths 
show abnormally little manifest anxiety, 1:67 
anxiety reactivity to the real-life anxiet 
stimuli referred to in the questionnaire. 

Both sociopathic groups scored significantly 
higher than the normals on the Pd scale of the ‘ 
MMPI, but this measure, which differentiate? 
at the phenotypic or genus level, does ne 
distinguish between the types or species g 
sociopathy represented in Groups I and II. ” 

Schedule difficulties unfortunately led tO 
reduction in the number of Ss to whom ea 
avoidance learning test could be given. Le 
nearly half of the total group, the availab 
testing time was too short to cover all of the 
procedures; in such cases the avoidance test, 
requiring nearly an hour to give, was passe 
over. Even on the residual sample of 34 SS 
however, rather clear-cut differences exist- AS 
a crude, overall index of avoidance learning: 
the avoidance scores (shock errors divided 
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unshocked errors) were averaged for all but 
the first of the 20 trials; this is the basis of the 
mean scores entered under “avoidance” in 
Table 1. The distribution was reversed to make 
high values represent greater avoidance of the 
shock. It is impossible, of course, to summarize 
adequately a complex learning process by a 
single numerical index of this sort, but in 
spite of these limitations, it is striking that 
Group I (primaries) shows the least avoidance 
as expected, Group II (neurotics) next, and 
Group III (normals) the most. The Group I 
versus Group III, and Group II versus Group 
III differences are significant by Festinger’s 
d-test (5), and the actual distribution of scores 
shows the groups to be remarkably well sepa- 
rated (only 17 per cent overlap between Groups 
I and III). This result supports hypothesis ¢ 
of this study, that the primary sociopath 
demonstrates defective avoidance learning. 


Results of the GSR Conditioning Series 


Of all the tests employed here, principal 
emphasis should be laid on GSR conditioning. 
The various difficulties attending the inter- 
pretation of GSR data are well known, but 
one fact stands out with relative certainty: 
given certain necessary conditions, if an S does 
not produce a GSR to a stimulus, one can be 
sure that he has not “reacted emotionally” to 
that stimulus. 

The two numerical indices which were de- 
rived as alternative ways of representing in a 
single value the conditioning indicated by the 
GSR protocols (anticipatory GSR to the 
buzzer after several pairings with shock) have 
already been described. As shown in Table 1, 
the group means are in the expected order on 
both indicants, with Group I significantly 
lower than Group III on GSR Reactivity and 
GSR Conditioning (.05 level, d-test). 

A somewhat more meaningful comparison 
is obtained by contrasting the reactivity by 
trials for the three groups. Group I shows the 
least GSR reaction to the CS in 14 out of the 
16 double trials. Group II is significantly 
higher (.02 level) than Group I at the end of 
the extinction trials. The positions of Group 
II and Group III interchange during the 
series with Group II beginning to show greater 
reactivity during the extinction trials, sug- 
gesting a perseveration (failure of extinction) 
of the anxiety response in the neurotic group. 


This trend was tested for statistical reliability 
by correlating the differences between Group 
II and Group III with the ordinal position in 
the conditioning series at which the difference 
was taken. The quadrant sign test (24) shows 
this association to be significant at the .01 
level. This result supports hypothesis a of this 
study, that the primary sociopath is defective 
in his ability to condition the anxiety response. 

The generalization scores were leptokurti- 
cally distributed, the group differences being 
determined by a few deviant Ss. Group II 
shows the highest mean generalization score, 
but the differences are not significant. 


SUMMARY 


Forty-nine diagnosed psychopaths were di- 
vided into two groups according to the de- 
scriptive criteria of Cleckley. Fifteen normals 
served as controls. A battery of tests related 
to anxiety reactivity or anxiety conditionabil- 
ity were administered. As compared with nor- 
mals, the Cleckley, or “primary” sociopaths, 
showed significantly less “anxiety” on a 
questionnaire device, less GSR reactivity to a 
“conditioned” stimulus associated with shock, 
and less avoidance of punished responses on a 
test of avoidance learning. The “neurotic” 
sociopaths scored significantly higher on the 
Taylor Anxiety Scale and on the Welsh Anx- 


iety Index. 
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relatively low relationship between the 

Intensity of the galvanic skin response 
(GSR) and the amount of overt emotional ex- 
pression when both were measured simultane- 
ously in nursery-school children. Although 
stimuli which on the average elicited strong au- 
tonomic responses were also found to evoke a 
great deal of observable emotional expression, 
for individual children the correlation over situ- 
ations between GSR activation and outward 
emotionality was quite low.” Since the measures 
employed were highly reliable, the low rela- 
tionship within individuals between internal 
and overt manifestations of emotions suggested 
that stable individual differences exist in the 
patterning of emotional expression. 

This provocative finding was repeated and 
extended by Jones in a further study (10). 
Here, one hundred adolescents were available 
who, in the course of participating in a longi- 
tudinal study, had experienced eleven test 
situations where GSR recordings were gath- 
ered. The 20 individuals who were highest in 
the average magnitude of their GSRs and the 
20 who were lowest were selected, and the in- 
dependently rated overt behavioral character- 
istics of the two groups compared. 

The “high reactives” were rated as less as- 
sertive, less animated, less talkative, and less 
attention-seeking than the “low reactives,” all 
of these differences occurring at the .01 level 


' S= years ago, H. E. Jones (9) found a 


1 This study was supported in part by grants to the 
Institute of Personality Assessment and Research from 
the Rockefeller Foundation and the National Institute 
of Mental Health of the National Institutes of Health, 
U. S. Public Health Service. Assessment is a group 
endeavor, and the writer is pleased to record his debt 
to the staff of the Institute who participated in the 
program and who helped provide the data reported 
here. The writer is particularly grateful to Samuel 
Fillenbaum for his advice and assistance. 

An abbreviated version of this paper was read at the 
meeting of the American Psychological Association at 
San Francisco, California, September, 1955. 

2 The correlation over the 22 situations between the 
8roup’s average GSR and the average rating on overt- 
Ness of response was .83 and .80 in the initial and retest 
experiments. For individual children, the correlatlon 
averaged .20 and .25 in the two separate tests. 
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of significance. The high reactive group was 
also evaluated as more calm, more deliberative, 
more good-natured, more cooperative, and more 
responsible than the low reactives, who were 
judged as irritable, excitable, impulsive, and 
immature. 

Jones suggested that the high reactives are 
introversive characters, while the low reactives 
represent the psychoanalytic concept of “im- 
pulse neurosis” (2). The process of socializa- 
tion is the agent causing the child to shift from 
undifferentiated, externalizing modes of ex- 
pression to complex, internalizing avenues of 
affect discharge. 

It is important to note that Jones also ob- 
served the presence of two additional patterns 
of emotional expression. He identified a “‘gen- 
eralizer” type, consisting of children who re- 
spond at both overt and internal levels simul- 
taneously, and a “reciprocal” type, represented 
by children who sometimes respond overtly but 
with little GSR reactivity and who at other 
times evidence a highly reactive GSR but with 
little overt response. 

The work by Jones has been summarized 
here because it is surprisingly little known and 
because it served as a springboard for the pres- 
ent experiment. His results have implications 
for conceptions of the nature and etiology of 
psychopathy and for a more general under- 
standing of the place and process of affect in 
personality. 

The present study aimed at an independent 
verification and extension of Jones’s findings. 
It contrasts with the previous work in several 
ways: the subject sample used was older and 
probably more homogeneous with respect to in- 
telligence, interests, and socioeconomic level; 
there was only one GSR recording session; the 
more stressful circumstance of lying was em- 
ployed as the stimulus situation; and finally, 
objective test measures as well as ratings were 
available for comparison with GSR reactivity. 


METHOD 


Subjecis and Experimental Procedure 
Seventy male applicants to a medical school under- 
went an 18-hour assessment program at the Institute 
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of Personality Assessment and Research. In e hes 
nged from 19 to 32 years with a mean at ; a 
a le the group was at a very superior me 
Their socioeconomic level was clearly middle are 
Each of the subjects (Ss) was observed in a yi e 
variety of personality revealing situations, — E 
harades and stress interviews to standardize psycho- 
j and dinnertime conversation. One of the 
eae each S experienced was the lie-detection 
caries At his scheduled time, S entered the experi- 
ae 1 tooni where he was greeted by the experi- 
ama (E) and seated before the imposing apparatus. 
rae een was attached around S’s chest, his 
one was inserted into a pulse pressure penne 
device, and electrodes were attached to his right and. 
Only the GSR circuit of the apparatus was in fact 
operative during the experiment, the respiration and 
pulse transducers being employed for their dramatic 
3 
ek R a short interview, inquiring about 
the origins of S’s medical interests, his professional 
aspirations, the nature of his father’s occupation, and 
so on, while an assistant adjusted the apparatus. S was 
then informed that the apparatus was a lie detector 
and that his skill as a liar was to be investigated. 
The S selected a number ranging from 1 to 5 with- 
out informing E of his choice. The E then queried S a 
number of times over each possibility, employing a 
fixed but unsystematic order of interrogation. To each 
question, S was instructed to say “no,” even where 
denial meant a lie. Questioning was varied i 
and accentuation so as to maintain the interpersonal 
aspects of the essentially standard stimulus situation. 
At the end of the series of 30 questions, S informed E 
as to his actual choice. In two additional replications, 
S selected an alternative (one of six months, one of 
six colors) and, again, E repeatedly interrogated him as 
to his choice. Again, S denied all possibilities, including 
the one he had in fact chosen. The temporal relations 
of stimuli and responses were recorded by noting E’s 
questions on S’s GSR record. 
It was found that some Ss displayed a diffuse and 
unselective GSR responsivity and some were relatively 
unreactive; most were selectively responsive, the GSR 
tending to appear only after the lie. Over the three 
trials, an accommodation effect was usually observed as 
the S adapted to the situation 
Two raters judged the GSR records for amount of 
reactivity and selected the 20 most highly reactive 
individuals and the 20 least reactive individuals, 
Reactivity was judged primarily on the basis of the 
number of GSRs the subject manifested without 
further quantification of the size or duration of the 
response. Ratings were employed rather than a simple 
count because of the necessity to evaluate occasional 
movement and instrument artifacts. Classification of 
Ss into the categories was highly reliable, 36 of the 40 
criterion cases being selected by both raters. The 
disagreements were resolved by mutual consent of the 
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3 Although there is some controversy among practi- 
tioners as to the suitability of the GSR as a lie detector 
under actual field conditions, there appears to be 
agreement as to its validity in experimental contexts 

E 


(8, 11). 


raters. Categorization of Ss into the two GSR criterion 
groups occurred before the examination of any other 
data. The results to be reported in the next section are 
based upon the comparison of the two groups, Re- 
actors and Nonreactors, on a number of measures. 


RESULTS 
Intellectual Differences 


Intelligence is a powerful determiner of be- 
havior, and its role in any particular context 
requires evaluation. Accordingly, the Reactors 
and Nonreactors were compared on measures 
defining various aspects of intellectual func- 
tioning. These included the Medical College 
Admissions test, grade-point average, a college 
vocabulary test (5), the Gottschaldt test (3), 
and a measure of originality based upon three 
of the Guilford tests of originality (6). 

On none of these measures did the differences 
between the two groups approach significance. 
There is no evidence that such differences as 
may exist between the two groups can be 
ascribed to intellectual factors as these are or- 
dinarily understood and measured. 


Differences on Rating Measures 


At the end of assessment, each S was evalu- 
ated by each staff psychologist. One of the 
means employed for codifying staff impres- 
sions of each S was the Q-sort method of per- 
sonality description (13); another was the 
Gough Adjective Check List (4). 

A comprehensive 115-item Q set was used 
which included items oriented toward highly 
inferential features of the personality system 
as well as toward the more readily observable 
aspects of the individual.4 Five psychologists, 
none of whom had observed the S’s behavior 
in the lie-detection situation, Q sorted each S. 
These data, combined into a composite by 
equally weighting the ratings of each judge, 
codified the impressions and evaluations of 
these staff members with respect to the subject. 
Because the psychologist raters had no infor- 
mation as to the behavior of the S in the test 
situation, the Q sort data provide a totally 
independent source of information as to the 
personality characteristics of each S. 

The comparison of the Q ratings for Reactors 
with the ratings for Nonreactors revealed 
twelve items differentiating at or beyond the 


“A copy of the Q set em 


é ployed is available from the 
writer. 
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.05 level of significance. These are listed in 
Table 1. 

A separate group of seven staff psychologists 
employed the Adjective Check List in their 
characterization of each S. These data, when 
summed across raters and analyzed, served as 
a cross check on and amplification of the 
Q findings. The adjectives differentially char- 
acterizing the Reactors and Nonreactors at or 
beyond the .05 level of significance are pre- 
sented in Table 2. 

It is clear from the nature of the differenti- 
ating Q items and adjectives that the two 
groups differ along some version of the psy- 
chological dimension suggested by Jones. The 
Reactors are perceived as withdrawing, worry- 
ing individuals who turn their anxieties toward 
internal routes of expression. The Nonreactors 
are evaluated as independent, aggressively di- 
rect, and relatively nonconforming, all of these 
being visible rather than inward or covert ex- 
pressions of impulses. 


Inventory Items Differentialing the Groups 

A number of personality inventory items 
(from the Minnesota Multiphasic Personality 
Inventory and the California Psychological In- 
ventory) distinguished between the two groups. 
Summarizing their import, it would appear 
that the Reactors introspect relatively often, 
are sometimes radically subject to their moods 
and impulses, and are very much concerned 
with the appropriateness of their behavior in 
social contexts. In contrast, the Nonreactors 
indicate an absence of rumination and a self- 
control of their behavior. Also expressed is a 
preference for autonomy and a difficulty with 
representatives of authority. 


Some Test Measures Differentiating the Groups 


All Ss experienced the rod-and-frame pro- 
cedure developed by Witkin (14). Here the S 
is placed in a totally dark room before a lumi- 
nescent square tilted from the vertical. He re- 
volves, via a suitable control, a rod centered 
within the frame until he perceives the rod as 
at the true vertical (or horizontal). In the 
blackness of the experimental room, the only 
external cue to verticality is the misleading 
frame. An alternative basis for judgment is 
available from internal, proprioceptive gravi- 
tational cues. In the present study, the Re- 
actors proved to be more affected by the ex- 


TABLE 1 
Q Items DISTINGUISHING BETWEEN THE REACTOR 
AND NONREACTOR GROUPS 


Characteristic of Reactors: 

10. His anxiety and tension find outlet in 05 
bodily symptoms and dysfunction; 
anxiety and tension are converted 
into somatic symptoms 


11. Is protective of those close to him .05 

14. Is readily dominated by others; sub- .05 
mussive 

19. Seeks reassurance from others 05 

30. Gives up and withdraws where pos- 05 
sible in the face of frustration and 
adversity 

40. Is vulnerable to real or fancied threat; 05 
generally fearful; is a worrier 

70. Behaves in an ethically consistent -05 
manner 


74. Lets other people take advantage of 


him; allows exploitation 05 
Characteristic of Nonreactors: 
94. Expresses his hostilities directly -O1 
96. Values his own independence and .01 
autonomy 
1. Critical, not easily impressed, skeptical .05 
62. Tends to be rebellious and non-con- .05 
forming 


ternal frame of reference than were the 
Nonreactors (p < .05). That is, the Reactors 
were more externally oriented and more open 
to outside influence in the presence of uncer- 
tainty than were the Nonreactors, who lent 
greater credence to their proprioceptive cues. 

A rather curious result emerges from a com- 
parison of the behavior of the two groups in 
the Crutchfield independence-of-judgment ex- 
perimental procedure (1). Here the subject is 
exposed to a group consensus about a problem 
which differs from what the S perceives as the 
correct or preferred solution. The extent to 
which S conforms to the group opinion in his 
manifested choice provides a measure of inde- 
pendence found by Crutchfield to be richly re- 
lated to character structure. 


TABLE 2 


ADJECTIVES DISTINGUISHING BETWEEN THE REACTOR 
AND NonrEACTOR GROUPS 


(.05 level of significance) 


Characteristic of Reactors: 


cautious idealistic 
dependent mannerly 
dreamy suggestible 
Characteristic of Nonreactors: 
clever leisurely | 
cool opportunistic 
evasive practical 
independent realistic 
ingenious 
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In the present analysis, Reactors and Non- 
eactors did not differ in amount of independ- 
nce as expressed by their choice behavior. 
But subjects also expressed their subjective 
-onfidence in their judgments. The Reactors, 
although actually conforming no more than 
Nonreactors, proved to be less confident about 
their course of action (p < .001). Independ- 
ence (or conformity) in the Reactor thus ap- 
pears to have a different psychological base 
than in the Nonreactor. It might even be said 
that manifesting independence in the face of 
uncertainty and doubts, as did the Reactors, 
required more courage than equivalent be- 
havior where subjective doubt does not pre- 
vail, as among the Nonreactors. 

Although the classification into Reactors 
and Nonreactors was based solely upon the 
evaluation of the GSR, some evidence for the 
physiological generality of reactivity may be 
adduced from a subsequent finding. Following 
a stressful psychodramatic situation (7), it was 
found that the systolic blood pressure of Re- 
actors was raised significantly more than that 
of the Nonreactors (p < .01), 


Discussion 


The present results, taken together with 
those of Jones and others, provide reasonable 
evidence that electrodermal responsiveness is 
related to personality structure and behavior. 
While in both studies, the number of significant 
relationships is small considering the multiple 
tests of significance, the consonance of both 
sets of data argues for their psychological sta- 
bility. The placing of these data within a con- 
ceptual frame of reference is another matter, 

An interpretation of the findings in terms of 
the notion of externalization-internalization 
simply does not seřve. The externalizing-in- 
ternalizing dimension has been only informally 
conceptualized, but it appears to be concerned 
with whether need tensions are discharged via 
external, action modes of expression or whether 
needs are routed internally into cognitive and 
visceral channels of discharge. Implicit here is 
the assumption that there exists a reciprocal 
relationship between overt motor response and 
cerebral or autonomic activity. That is, overt 

motor response displaces internal activity and 
vice versa. Jone’s finding, supported by Sey- 
mour (12), that there are “generalizers,” in- 
dividuals whose expression of need states is 


simultaneously overt and covert, upsets such 
a notion. Observations of a culture like the 
Italian, where emotion is vigorously displayed 
but also apparently internally experienced, also 
weigh against the reciprocal theory of externali- 
zation-internalization. Clearly, an additional 
dimension or a reformulation of the concept is 
required in order to encompass the various 
patterns of emotional expression which have 
been noted. 

It is suggested here that an important dis- 
tinction is being obscured in the definition of 
externalization-internalization by opposing 
overt motor response on the one hand against 
cognitive and visceral channels of discharge on 
the other. The italicization of and is important, 
for it is precisely this conjunction which may 
be inappropriate. From an instrumental point 
of view, cognitive activity and motor behavior 
are both means to drive reduction, whereas 
autonomic, visceral reaction serves no such 
function. Autonomic activity appears instead 
to be an expression of an affective state and 
does not in and of itself bring about a signifi- 
cant reduction of the disturbing emotional 
charge. Tension reduction is brought about 
only by appropriate cognition, fantasy, or 
motor behavior, not by autonomic arousal. 

If this reformulation appears a tenable one; 
then externalizing-internalizing can be rede- 
fined as a way of characterizing the direction oF 
orientation of an individual’s response to the 
impelling affective disequilibrium. GSR labil- 
ity, from this point of view, is simply a con- 
venient index of the tensional state, of whether 
an individual has anxiety or affect in a given 
situation or. whether he does not have anxiety 
or emotion in that situation. It has no impli- 
cation, however, for the direction of the re- 
sponse precipitated by the emotion. Since affec- 
tive responsiveness may eventuate in motor 
as well as cognitive behavior, the dilemma 
previously posed by the “generalizer” no 
longer exists. 

The dimension distinguishing between the 
Reactors and Nonreactors in the present study, 
may be the dimension of affective responsive- 
ness without regard for just how the individual 
copes with emotional states. There has been 
little direct investigation of affective respon- 
siveness although it is known that there are 
great differences in the extent to which indi- 
viduals experience affect and have their self- 
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percept brought forward into consciousness as 
a conditioner of behavior. Some individuals 
experience too much affect, e.g., anxiety neu- 
rotics, and are thereby incapacitated. But 
others do not experience sufficient emotion, 
sufficient in relation to a normative criterion, 
and therefore do not react on the basis of an 
implicit but culturally shared set of behavioral 
premises. Because they operate from a differ- 
ent premise system, these individuals are more 
likely to emit behavior which is inconceivable 
to the affectively responsive observer. Such 
people include the primary psychopath, cer- 
tain forms or stages of the schizophrenic reac- 
tion and, in general, those individuals identi- 
fied by psychiatrists as possessing “flattened 
affect.” 


SUMMARY 


Seventy Ss, during the course of an assess- 
ment program, experienced a lie-detection situ- 
ation where GSR reactions were recorded. The 
twenty most reactive subjects and the twenty 
least reactive subjects were selected and com- 
pared on a variety of measures. Reactors 
appeared to be more dependent, dreamy, ideal- 
istic, and suggestible; Nonreactors were evalu- 
ated as relatively cool, evasive, opportunistic, 
and independent. The findings were related to 
the previous work of Jones, and a reformula- 
tion of the notion of externalization-internali- 


zation was offered. 
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DIATE AND DELAYED EFFECT OF FAILURE AS A 
iig IMME PUNCTION OF TASK COMPLEXITY AND 
PERSONALIZATION OF FAILURE 


CHARLES B. TRUAX anp BARCLAY MARTIN 


University of Wisconsin 


REVIOUS research has suggested various 
factors as being important in deter- 
mining the effects of experimentally 

induced stress upon performance. This study 
is concerned with the manipulation of several 
of these factors within the framework of one 
experimental design. There is ample evidence 
that individual differences in general anxiety 
level, whether measured by paper-and-pencil 
tests or assessed by psychiatric diagnosis, are 
related to performance under stress conditions 
(1, 3, 4, 5, 9, 14, 15). The typical finding in this 
respect is an interaction of anxiety level by 
stress, so that high anxious subjects (Ss) tend 
to do less well under stress than under neutral 
conditions, and low anxious Ss tend to perform 
relatively better under stress than under neu- 
tral conditions. 

This finding, however, cannot be accepted 
without qualification, and it was essentially 
the purpose of this study to investigate some 
of the factors which may affect this interaction 
of anxiety level and stress. One such factor is 
the level of complexity of the task being per- 
formed or learned. Studies have generally indi- 
cated that the more complex the task, the 
more poorly the high anxiety Ss do in relation 
to the low anxiety Ss (2, 7, 10, 12, 13). Thus, 
in the present experiment two levels of com. 
plexity have been introduced. 

Another factor which may influence the 
effects of failure (as one form of stress) on 
performance is the length of the time interval 
interposed between the failure and the post- 
failure performance. This variable has usually 
been described in terms of the immediate 
versus delayed effects of failure, and previous 
findings are somewhat conflicting. Russell (8) 
found failure conditions to be associated With 
poorer retention scores than nonfailure condi- 
tions under immediate, but not delayed (24 
hours), recall conditions. The task used was a 

list of 12 nonsense syllables learned by the 
seria] anticipation method. Farber, Russell, 
and Andreas (1), using a paired-associate ver- 
bal learning task, found that failure adversely 
affected performance on immediate tests of 
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retention; but on delayed tests, the failed 
group tended to perform better than the con- 
trols. Only the decrement under the immediate 
testing conditions was significantly different 
between the failed and nonfailed groups how- 
ever. 

On the other hand, Zeller (16, 17) found 
both an immediate and delayed (72 hours) 
effect of failure relative to nonfailure condi- 
tions, using both a recall and a relearning score 
based on a paired-associate task. One clue 
suggesting an explanation for the discrepancy 
between Zeller’s study and the others men- 
tioned above may be found in the nature of 
the failure comments given. Zeller’s failure 
remarks were without a doubt extremely severe 
and were to a considerable extent personally 
insulting to S. The comments in the Russell 
and the Farber, Russell, and Andreas studies, 
while without question indicating to S that he 
failed, were clearly much less severe and per- 
sonalized. It may be, then, that delayed effects 
of failure become apparent only under rather 
strongly personalized conditions. In other 
words, failure experiences which are explicitly 
made to reflect upon the general competence 
and integrity of S may have longer lasting 
effects than those which imply only failure on 
the immediate performance. 

To explore these possibilities, immediate and 
delayed postfailure testing, mild and severe 
Personalization of failure, two degrees of task 
complexity, and subgroups divided at their 
medians on the basis of either the Taylor 
Manifest Anxiety Scale or the Stroop Interfer- 
ence Test were all incorporated as variables 
within the experimental design. The Stroop 
test (11) was included because it seemed likely 
that it might give some measure of a person’s 
general susceptibility to interfering response 
tendencies, a trait that Waterhouse and Child 
(14) have suggested as being closely related to 
quality of performance under stress. 


METHOD 

The Ss were 128 fem 
psychology course at t 
teen Ss were random] 


ale students in the introductory 
he University of Wisconsin. Six- 
y assigned to each of 8 treatment 
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ela two different tasks involved different 
‘ariables, the results had to be analyzed 
separately; accordingly, the s caer ra 
as consisting of wey he study can be described 
designs. Thus, f Wo separate two-by-two factorial 
à ; S, for the simple task condition, there 
were two levels of pers silisati F iip 
of failure orthogons i sonalization (mild and severe) 
postfailure testin a 9 two intervals between pre- and 
complex task bie ha seconds and 24 hours). In the 
The individual eee a similar design was used. 
dividing the sh erence measure was introduced by 
Taylor Manifest eros at the median on either the 
ence Test ada : nziety Scale or the Stroop Interfer- 
torial deim Sine an additional dimension to the fac- 
several Se py mui Taylor scores were not obtained on 
N of ta Hl m necessary to reduce all subgroups toan 
A orir to retain cell proportionality for this 
Taylor Ree 1e were used for the Stroop test. The 
Saa aa est Anxiety Scale was administered to the 
dividas hee their regular class period before the 
al testing session. 
ee Stroop Interference Test was given at the begin- 

g Of the individual session. This test consists of three 
Parts: Card A, a page on which only the four color 
names of red, blue, yellow, and green are printed in 
black ink in a random arrangement of eleven rows of 
ten words each; Card B, a page on which the same colors 
appear in 34 inch colored dots, arranged randomly in 
the same format as Card A; and Card C, a page on 
which the same colors are printed in random order, with 
cach color name printed randomly in a color other than 
the one it designates. The 5’s task was (a) first to read 
as fast as possible the color names on Card A, (b) to 
name as fast as possible the colors of the color dots on 
Card B, and (c) to name as fast as possible the colors on 
Card C in which the words were inked, ignoring any 
interference produced by the “wrong” names. The 
score used was the ratio of the time to finish Card B to 
the time to finish Card C. 

The simple task. The simple task consisted of S’s 
adding 78 pairs of one-digit numbers. The score was 
the time in seconds to complete the problems. Uncor- 
rected errors on this extremely simple task were not 
frequent enough to affect the time score. The same task 
was used in the postfailure testing. 

The complex task. The complex task consisted of the 
Arithmetic subtests of the Wechsler-Bellevue Intelli- 
gence Scale, Forms I and II. One form was presented 
under prefailure, and the second form was given under 
postfailure conditions with order counterbalanced 
within each of the 8 major experimental groups. The 
weighted score derived from the Wechsler-Bellevue 
standardization norms, involving both time and accu- 


racy measures, was used. 
The failure inducing 


task. Failure comments were 


given for a totally different task so that all Ss would 
experience failure in a common situation. This task was 
an extended version of the Wechsler-Bellevue Digit 
Symbol Substitution Test. It was presented on a dittoed 
sheet with the label, “Wisconsin Digit Symbol Test of 


Intellectual Ability, College Form AS” prominently 
displayed across the upper third of the page. The time 
allotted for the completion of this task was varied with 
S’s speed to insure that S would complete only three- 
fourths of the test. L. 
Procedure and instructions. The following instruc- 
tions were given initially for the Stroop Interference 


™, 
test: 
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We are in the process of obtaining normative data 
on a test which we hope will be a good measure of 
Reading Ability. This test is in three parts, and we'll 
start off with the first part now. 

The Stroop test was then administered, after which S 

was told, 

As you may know, in standardizing a test you 
have to get other measures on the subjects in order 
to check for a normal distribution. For example, if 
all of our subjects were exceptionally intelligent or 
quick reacting, the Reading Test wouldn’t be very 
good for estimating the ability of the average college 
student. So we have some other tests for you to take. 
The simple or complex task was then administered, 

depending on the treatment group to which S was 

assigned. After this task, the following “explanation” 
was given of the failure inducing task: 

Reading ability, as you probably know, is related 
to IQ, so to standardize the Reading Test for a 
college population, we will have to get a measure of 
the IQ of the subjects used in this study. We will use 
the Wisconsin Digit Symbol Test. It’s one of the 
shortest IQ tests available, but it’s still very reliable. 
The average college student will get about 114; the 
brightest student so far in this study has got 129 
items right. 

Don’t expect to finish the test entirely in the two 
minutes you will have—just do the best you can. 
After the failure inducing task was administered, 

experimenter (Æ) laid a template over the test and pre- 

tended to score it. The following comments (which had 
previously been memorized by Æ) were then made in 


the “severe personalization” group: 
Damn! (E frowns and shakes head). Ninety-two. 


You really goofed this test..... (S’s first name). 
You’re well below the norm of 114. (Pause.) You 
having much trouble keeping up with your classes? 
Well, either you were just goofing up this experi- 
ment.....(S’s first name), or something’s wrong 
with you. A score of 92! Damn!!! 

The following comments (which had previously been 
memorized by Æ) were made in the “mild personaliza- 
tion” group: 

Ninety-two. (E pauses and continues in a moder- 
ate tone.) The score is 92. (Pause.) This is below the 
norm and not very good. (Pause.) A score of 921 
E paced the mild and severe personalization remarks 

so that the same amount of time was taken in giving 
both, about 17-18 seconds. 

After the induction of failure, half the Ss were re- 
tested immediately on the performance task (either 
simple or complex), and an appointment was made for 
the other half to meet for a second session 24 hours 
later. All Ss were given the following instructions 
before the administration of the postfailure test: 

Well, let’s do this arithmetic (addition) test over 
again. Remember, I’ll be timing you, so work as 
quickly as possible without making mistakes. 

At the conclusion of the experiment, each S was told 
the purpose of the study and allowed to vent her feelings 


about the failure experience. 


RESULTS 


re was a negative correla- 


i k. The 
Simple tas the simple task 


tion of —.68 between scores on 
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under the prefailure condition and the change 
score from pre- to postfailure. Also, by chance 
apparently, the mild and severe personalization 
groups differed greatly in their prefailure 
scores. Accordingly, analysis of covariance was 
used, making it possible to analyze only that 
variance in the change scores that was uncor- 
related with the prefailure scores. The variance 
within the different treatment groups did not 
differ significantly as indicated by Bartlett’s 
test for homogeneity of variance. 

It should be mentioned, before these anal- 
yses are considered, that a previously per- 
formed analysis of variance with repeated 
measures indicated a significant tendency (p 
< .001) for all Ss to improve their performance 
on this simple task after failure. This general 
tendency would not appear, of course, in the 
analysis of covariance in which change scores 
were used; but it indicates that any found 
effects will represent differential tendencies to 
improve from pre- to postfailure conditions. 

The Stroop test did not prove to be signifi- 
cantly related to change in performance on 
either the simple or complex tasks, and results 
with respect to this measure were not further 
considered. 

Table 1 presents the analysis of covariance 
with the subgroups divided into high and low 
Taylor scale scores. The main effect of the 
immediate-delay variable was significant in 
the direction of greater improvement in per- 
formance under the delayed postfailure testing. 
Individual differences in anxiety were also 
significantly related to change in performance. 
High anxious Ss showed greater improvement 
(p < .01) than low anxious Ss, and there was 
an interaction of anxiety with mild-severe 


TABLE 1 


ANALYSIS OF COVARIANCE FOR THE SIMPLE 
Task USING THE TAYLOR SCALE 


Source df gemi F 

Taylor scale (A) 1 | 110.94] 7.25** 
Personalization (B) 1 1.87 — 
Immediate-Delay (C) 1 76.80 | 5.02* 
AXB 1 | 203.44 | 13.24*** 
AXC 1 41.89 | 2.74 
BXxC 1 15.47 | 1.01 
AXBXC i | 122 — 
Within groups 47 15.31 = 

*p <05. 

p< 01. 

#22 p < 001. 


TABLE 2 


ANALYSIS OF COVARIANCE FOR THE COMPLEX 
TASK USING THE TAYLOR SCALE 


| Mean 

Source df aon F 
Taylor scale (A) | 1 9.05 2.86 
Personalization (B) | 1 0.01 | = 
Immediate-Delay (C) 4 30.93 | 9.76** 
AXB il we | = 
AXC 1 | 005 | — 
BXC i 1.87 — 
AXBXC 1 12.09 | 3.82 
Within groups 47 3.17 | 


“p< 01. 


personalization (p < .001), suggesting that 
high anxious Ss improved less under severe 
personalization than under mild, whereas low 
anxious Ss improved more under severe than 
under mild personalization. 

This interaction was further investigated by 
performing an analysis of covariance upon the 
high and low groups separately. The low anx- 
iety group was found to perform significantly 
better under severe than mild personalization 
(p < .05), but there was no evidence that the 
high anxiety group performed better under 
mild than severe personalization. The interac- 
tion effect was apparently confined to the 
lower portion of the anxiety distribution. 

Complex task. A repeated measures analysis 

of variance revealed no significant tendency 
for Ss in general to change in performance 
from pre- to postfailure on the complex task. 
Initial scores correlated with change scores 
—.54, so analysis of covariance was also used 
here. Table 2 presents the analysis of covari- 
ance for subgroups divided on the basis of the 
Taylor scale. The immediate-delay variable 
was Significant (p < .01), with Ss again show- 
ing more improvement in performance under 
the delayed postfailure testing. No other 
variables were significant. 
_ It is sometimes reported that variability 
creases as a function of failure (6). The 
variability in the prefailure situation was com- 
pared to that in the postfailure situation for 
both simple and complex tasks. Changes were 
small and insignificant in both cases; the F 
values were, respectively, 1.19 and 1.02. 

The differences in results between the simple 
and complex tasks do not appear to be due to 
differences in the extremeness of groups based 
on the Taylor scale. Table 3 presents the mean 
Taylor scale scores for the groups and their 
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TABLE 3 
MEANS AND PERCENTILE RANKS OF GROUPS 
ON TAYLOR SCALE 


č Simple Task Complex Task 
roup 

Mean P.R. Mean P.R. 
High anxiety 23.22 86 20.95 80 
Low anxiety 9.72 32 10.00 34 


percentile rank as estimated from the Univer- 
sity of Iowa normative data. 


DISCUSSION 


No main or interactive effects were found 
when the Stroop Interference Test was used 
as the individual difference measure. These 
negative results suggest that anxiety is either 
not fruitfully conceptualized in terms of a 
general susceptibility to interfering responses 
as Waterhouse and Child (14) suggest or that 
the Stroop test is not a valid measure of this 
trait. 

When the Taylor scale is used to assess 
individual differences, significant relationships 
were found with pre- to postfailure perform- 
ance changes for the simple but not for the 
complex task. This suggests that the Wechsler 
Arithmetic Subscale may be relatively less 
sensitive to the effects of failure than the 
simple additions task, although there was a 
tendency for performance on the Arithmetic 
subscale to be affected by the immediate-delay 
variable. Both tasks involve relatively well 
learned past habits, and this common aspect 
of both tasks is probably not particularly 
susceptible to the effects of failure and general 
anxiety level. The simple additions test, how- 
ever, is much more predominantly a speed 
test, and it may be that its relationship to the 
failure and anxiety variables is essentially a 
function of the speed factor, which seems much 
less important in the Wechsler problems. 

The significant effects of the immediate- 
delay variable for both the simple task and 
the complex task suggest that failure has a 
greater effect after a 24-hour delay than under 
immediate retesting conditions. Nevertheless, 
this interpretation requires considerable cau- 
tion since, first, it may hold only for conditions 
in which performance improves after failure, 
and second, no control for practice effects was 
available in this experiment. With regard to 
the latter consideration, however, the con- 
tention would have to be that practice has a 


more beneficial effect after a 24-hour delay 
than upon immediate retesting, which seems 
possible but rather improbable. 

The results of this study are in some re- 
spects consistent with those of Farber, Russell, 
and Andreas (1), who found that a failed 
group did worse than a nonfailed control group 
under immediate retesting conditions, but 
tended to do better than the control group 
under delayed retesting. Russell (8) also found 
the negative effects of failure to be greater in 
the immediate than in the delayed situation. 
With the present results, these findings suggest 
that the detrimental effects of failure are more 
pronounced in the immediate than in the 
delayed situation, whereas the facilitating ef- 
fects of failure are more pronounced in the 
delayed than the immediate situation. 

For the simple task, it was found that high 
anxious Ss showed greater improvement after 
failure than low anxious Ss. This result appears 
to contradict the usual findings. Williams (16), 
employing a similar task of simple addition 
problems, but using the Waterhouse and Child 
Interfering Tendency Questionnaire to select 
high and low groups, found an interaction in 
the opposite direction (p < .10), with high 
scorers performing about the same after non- 
failure as after failure conditions. Waterhouse 
and Child (14) report similar interactions with 
somewhat different kinds of tasks. 

However, if one introduces the variable of 
mild-severe personalization of failure, the re- 
sults of the present study no longer seem so 
contradictory of previous findings. The low 
anxious Ss show more improvement under 
severe than mild personalization of failure, 
whereas the high anxious Ss do not show this 
differential improvement. The resulting inter- 
action of anxiety with mild-severe personaliza- 
tion therefore seems to represent a psychologi- 
cally meaningful process. Thus, it looks as 
though the performance of high anxiety Ss on 
this relatively simple task is not adversely 
affected by either mild or severe failure. On the 
contrary, their performance is comparably 
facilitated by the additional anxiety or moti- 
vation under both conditions. The low anxiety 
Ss, however, show a greater improvement 
under severe than mild personalization. It may 
be that high anxiety Ss are approaching an 
optimum state of tension in the mild condi- 
tions and, accordingly, further stress does not 
enhance performance; but the low anxious Ss, 
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having not yet reached the optimum level of 
tension, continue to improve in performance 
under severe personalization. This finding is 
consistent with the general hypothesis that 
adequacy of performance as a function of 
anxiety is a curvilinear-function of an inverted 
U-shape type. The fact that the high anxiety 
group has apparently not gone beyond the 
optimum state of tension to the point where 
performance begins to be disrupted may be 
due to the extremely simple nature of the 
one-digit additions task. This interaction of 
anxiety level with degree of personalization 
also indicates the importance of reporting in 
detail the operations defining the failure in 
any given piece of research, since this variable 
seems to affect performance significantly. 


SUMMARY 


This experiment was designed to study the 
effects of several variables upon changes in 
performance from pre- to postfailure condi- 
tions. All Ss were female college students. For 
one group of Ss performance was measured 
on a simple task, one-digit addition problems; 
for another group of Ss, performance was 
measured on a more complex task, the Arith- 
metic subscale of the Wechsler-Bellevue. A 
factorial design was used for each of these 
groups with the independent variables being 
mild-severe personalization of failure, immedi- 
ate-delay of postfailure retesting, and high and 
low groups based on either the Stroop Inter- 
ference Test or the Taylor scale. 

All Ss tended to improve in performance 
after failure on the simple task. High anxious 
Ss tended to improve more than low anxious 
Ss. The interaction of anxiety with mild-severe 
personalization was significant with low anx- 
ious Ss tending to improve more under severe 
than mild personalization, and high anxious 
Ss improving about equally under both condi- 
tions. The immediate-delay variable was sig- 
nificant with Ss showing greater improvement 
after the 24-hour delay. 

The only relationship found, when perform- 
ance on the complex task was used as the de- 
pendent variable, was a tendency for the Ss to 
improve more after the 24-hour delay than 
upon immediate retesting. 

The findings in regard to the immediate 
delay variable suggest the tentative hypothesis 
that the facilitative effects of failure are greater 


after a delay, but the detrimental effects are 
greater when retesting is done immediately. 
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USES OF COMMON OBJECTS AS INDICATORS 
ORIENTATIONS! 


WAYNE DENNIS 
Brooklyn College 


F children are asked with regard to each of 
several objects, “What is it for?” one 
AA finds that children’s concepts of the func- 
tions of identical things vary from society to 
society. Those functions of common objects 
which are frequently mentioned by the chil- 
dren of a social group, it is here proposed, are 
indicative of the activities, interests, and values 
that prevail in that society. That is, it is held 
that children value things for the uses to which 
they may be put, and, conversely, that fre- 
quently-mentioned uses are indicative of 
values. For example, if children in one society 
often mention boys and girls, cats and dogs, 
trees and sand, as functioning chiefly in play 
whereas the children of another group mention 
them more often in connection with work, 
these facts suggest that there is a difference 
between the two groups in regard to work-play 
emphases. The present paper illustrates the 
application of this method, The Uses Test, 
with three groups of children, an American 
group, a Lebanese group, and a Sudanese 
group. , : 

If the objects used in questioning children 
are universally present, this test procedure can 
be employed in all cultures. Techniques for the 
study of child behavior that are universally 
applicable are very small in number. The 
method employed in the present study is of- 
fered as a useful addition to the small stock of 
tools available for cross-cultural research. It is 
believed to be particularly useful as an explora- 
tory technique in societies whose patterns of 


1 The research here reported was done during 1955- 
56 when the author was a visiting professor at the 
American University of Beirut, Lebanon. Expenses of 
the study were defrayed by a grant of the Rockefeller 
Brothers Fund to the American University of Beirut. 
The writer wishes to express his gratitude to Mrs. 
Adele Hamdan Taky Din, Miss Leila Biksmati, Mrs. 
Marie Therese Broussalian, Mrs. Yvonne Sayegh and 
Miss Aimee Sabbagh who served as research assistants, 
and to the principals, teachers, and pupils whose coop- 
eration made the study possible. He is also greatly 
indebted to staff members of the departments of psy- 
chology and education, and to members of the child 
development seminar at the American University of 
Beirut, who gave invaluable assistance and counsel 
throughout the course of the investigation. 
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child behavior are but 
investigator. 


METHOD 


The question employed is the simple and probably 
universal one, “What is for?”. Naturally, this 
question may be asked in regard to any plant or animal, 
any class of persons, or any part of the body. 

The question, “What is for?,” was chosen 
because numerous studies indicate that the young child 
readily thinks in terms of use. He spontaneously asks 
what things are for, he defines objects in terms of their 
use, and he seems to presume that every object in the 
world has a use. No difficulties have been encountered 
in obtaining answers to the question from hundreds of 
children between ages five and ten in a variety of cul- 
tures. In all likelihood it can be used above and below 
the age levels represented in this study. The technique 
is not limited to the particular word list employed here. 
The objects concerning which children are questioned 
will vary with the interests and hypotheses of the in- 


vestigator. 
In the present study, the following words were used 


in the order given: the mouth, the hands, a mother, a 
father, a boy, a girl, trees, wood, dogs, cats, birds, 
stones, the sun, sand, rain, and gold. In the case of the 
American and Lebanese subjects, data were gathered 
by individual interview. The Sudanese subjects were 
instructed orally as a group, and they were requested to 
write their answers. 

Although the question put to the child did not ask 
for more than one use for each object, some children 
gave more than one. In the present study, responses 
beyond the first were not included in the analysis of 
the data, unless specific reference is made to answers 


beyond the first one. 


Subjects 


American. These were American children attending 
the American Community School in Beirut, Lebanon. 
The majority of these children were born in the United 
States and spent their pre-school years there. They were 
in Lebanon in 1955-56 because their parents were tem- 
porary residents of that country. Most of the fathers 
were employed by the University, or American busi- 
ness firms. Nearly all could be classified as belonging to 
the American middle class. All children between ages 
5.0 and 11.0 were tested. These totaled 120 children, 
approximately equally representing the two sexes and 
the various ages. 

Lebanese. This group consisted of an analytic sample 
of native-born children in Beirut, Lebanon. The sample 
was divided into 36 cells, produced by cross classifying 
in terms of four criteria: sex, private schools vs. public 
schools, age (5-6 years, 7-8 years, 9-10 years) and 
religious community (Arab-Christians, Moslems, and 
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Armenians). With 25 children in each cell, the sample 
comprised 900 subjects in all. All children of appropri- 
ate ages were interviewed in each of several schools, the 
pupils being subsequently assigned to cells. If more 
than 25 so obtained Ss fell into a given cell, the num- 
ber was reduced to 25 by discarding the data from Ss 
by chance. 

woes This group consisted of 13 girls and 45 
boys, all aged 9 and 10, from a private school in Om- 
durman, the capital of Sudan. Unlike the others, these 
children gave their answers in writing (in Arabic). 


Treatment of Results 


For the answers to each question, categories (to be 
described below) were established to which the majority 
of replies could be assigned. Each answer was assigned 
to only one category. Answers that did not fall into any 
of the defined categories are not represented in the 
tables of results. 


RESULTS 


The percentage of each group giving each 
type of answer to each question is indicated in 
Table 1. The frequencies for the Lebanese sub- 
groups are shown in Table 2. 

The major group contrasts in regard to each 
question will be discussed in the order in which 
the questions are listed in the tables. In gen- 
eral, only group differences high in statistical 
significance are noted. In Statements assessing 
the significance of differences, A-L refers to an 
American-Lebanese comparison, L-S refers to 
a Lebanese-Sudanese comparison and A-S to 
an American-Sudanese comparison. 


The Mouth 


According to the subjects, the primary uses of 
the mouth are eating and talking. These two an- 
Swers make up 100%, 96%, and 92% of the first 
responses to question 1 for the three groups re- 
spectively, 

“Talking” as a response to this question is 
given more frequently by the Americans than by 
either of the other groups. Table 2 shows that 
none of the Lebanese subgroups equals the Amer- 
icans in this regard. Conversely, eating is a 
response given more often by all Segments of the 
Lebanese sample than by the Americans, 

The Sudanese are the only Ss to mention 

drinking as the first response to the question “What 
is the mouth for?” Not shown by Table 1, because 
it deals only with first-mentioned uses, is the fact 
that among the Sudanese children a very common 
response to the question, “What is the mouth 
for?”, is “eating and drinking.” This joint mention 
of eating and drinking occurred among 50% of the 
Sudanese, but among only 3% of the Americans 
(S-A p < .001) and none of the Lebanese. The 
frequent mention of drinking in Sudan is probably 
related to the high temperatures prevailing there, 
which require a high water consumption. 


It is noteworthy that many possible uses of the 
mouth were not mentioned by any child in any of 
the three groups. Many possible replies were ab- 
sent not only as first responses but failed to appear 
among subsequent responses as well. Among such 
absent responses were singing, chewing, spitting, 
shouting, biting, pointing, smiling, laughing, 
crying, coughing, sucking, blowing, whistling, 
praying, and cursing. It seems likely that in so- 
cieties other than the three which were investi- 
gated one or more of these would be common word 
associations with the mouth. 


Hands 


For all three groups of children, the uses of the 
hands are more diversified than the uses of the 
mouth. The most common Tesponses to the ques- 
tion, “What are hands for?” are to hold things, to 
grasp, to work, to carry, to touch with, to eat with, 
and to write with. Only the frequencies of eating, 
writing, working, and playing are examined below. 

Eating as something for which the hands are 
used is mentioned much more often by the Leba- 
nese and Sudanese (by 23% and 33% of the re- 
spective groups) than by the Americans (6%) 
(p of A-S and A-L < -001). These figures no doubt 
reflect differences in eating customs among the 
groups, holding food with the hands being common 
among some groups of the Lebanese and Sudanese. 
Among the Lebanese reference to use of the hands 
in eating is made more often by the government 
school pupils who in general belong to a lower 
economic group than by the private school pupils 
(Table 2). 

In regard to writing and working, the response 
frequencies of the three major groups are almost 
identical. Play is mentioned most frequently by 
the Americans, seldom by the Lebanese, not at all 
by the Sudanese. Other Tesponses are of such a 
miscellaneous nature as to be difficult to treat in 
a comparative manner, 


Mother 


2 


In all groups, Providing food and care for chil- 
dren comprise the major functions of a mother in 
the eyes of the child. Providing food is most often 


Although the general response, “she takes care 
of us,” was common, in all groups there is an al- 
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Percentage Frequencies 


TABLE 1—Continued 


Percentage Frequencies of 
Responses 


Stimulus a 
Words and Use—Categories American Lebanese Sudanese 


P of Responses 
Stimulus 
Words and Use—Categories American Lebanese Sudanese 
Mouth 
eating 61 74 yi 
talking 39 22 14 
drinking 0 0 7 
Hands 
eating 6 23 33 
writing 11 10 16 
working 12 12 12 
playing 4 1 0 
Mother 
providing food 16 51 7 
providing care 62 44 41 
nursing 0 0 19 
Father 
working, earning money 55 76 10 
buying food 1 16 7 
providing care 12 6 16 
assisting family 14 3 29 
Boy 
going to school 16 37 7 
working, helping par- 23 24 17 
ents 
future reference il 20 12 
playing 34 9 0 
a hool 12 33 9 
going to schoo. 
working, helping par- 28 45 34 
ents 
future reference 12 11 f 
playing 27 5 
o 4 o 2 
‘ood 
d 8 8 10 
woo 2 
shade 10 2 1 
climbing 12 1 0 
bi 43 33 5 
building 34 17 14 
furniture 12 33 a 
gates 0 0 
Dogs 7 
guarding, watching s g og 
barking, biting 4 8 7 
hunting | 49 ri 3 
pets, playing 
Cats 53 
catching mice 16 e 2 
meowing 2 6 7 
pets, playing 5 
Birds 
i rÀ 37 16 
Due singing 34 36 36 
R 8 33 8 2 
njoying 
Stones 
; seg 19 13 2 
throwing, hitting an 73 53 


building 


sm 0 9 2 
drying 
warming 25 28 9 
light 53 47 67 
sm i 41 18 2 
playing 
building 21 58 26 
Rain 
growing plants 75 46 41 
providing water 19 16 12 
Gold 
decorative 16 53 67 
economic 61 26 2 


most complete absence of reference to specific 
motherly acts other than feeding. No child men- 
tioned sweeping, washing dishes, laundering, 
ironing, sewing, dressing, washing faces, bathing, 
shopping, putting to bed, arousing, getting off to 
school, or the superintending of these activities, 
although some of these would probably have been 
mentioned if as many responses as was possible 
had been obtained to each question. Also unmen- 
tioned are helping with lessons, moral instruction, 
reward and punishment, dispensation of justice 
within the family, love and affection, story-telling, 
and parent-child play. 


Father 


If the child’s point of view is accepted, the life 
of a Lebanese father must be a dreary one. He 
works and earns money in order to support the 
family. This is the general tenor of all answers. As 
in the case of the mother, there is no mention of 
his playing with the child, rewarding or punishing, 
or of expressing love or hate. So far as the manifest 
content of the responses are concerned, the Leba- 
nese child seems to have a father who provides for 
him but who otherwise has little relationship to 
him. The answers of the American children are 
similar, but there is a much larger miscellaneous 
category. In contrast to the Lebanese and the 
Americans, the Sudanese child mentions the 
working and earning functions of his father very 
seldom. Only 10% of the Sudanese answers fall 
into this category as compared to 55% and 76% of 
the Americans and Lebanese respectively. As- 
sisting the family and taking care of the family are 
common answers among the Sudanese but are 
infrequent in the other two groups. 


Boy and Girl 


“What is a boy for?” “What is a girl for?” 
Nearly every answer to these questions can be 
placed in one of four categories: The function ofa 


TABLE 2 
COMPARISON OF LEBANESE SUBGROUPS 


Sfwdis Percentage Frequencies of Responses 
> is- g ir | Pri bli | 9-10 
Words and Use—Categories lems | Gans | Alans | Boys | Girls Schoos | Schouis | 56 yrs:| 7-8 yrs: yrs. 
oo 81 80 70 79 75 73 75 79 82 70 
talking 18 19 29 20 24 27 17 19 17 31 
arn 2: | 2e | | a | | as | oo | a | ae | 40 
writing 10 14 10 9 13 15 5 4 13 16 
working 22 18 13 11 14 12 12 7 16 27 
pee 1 1 1 0 1 1 3 2 1 
Mother z 
Taine food SS | 42 | 57 | 55 | 47 | 44 | so | 65 | ss | 34 
rine oe 41 | 53 | 39 | 39 | 49 | 54 | 34 | 23 | 33 | oa 
Father á 
a= ë jglgjsieigiglyigigig 
uying foo 
providing care 1 4 3 3 3 3 2 2 2 
assisting family 2 1 1 2 . 1 1 1 
OF cienbed 40 | 37 | 32 | 43 | 30 | 41 | 34 | 40 | 3g | 3 
working, helping parents 17 18 38 33 26 22 27 16 30 27 
future reference 15 28 14 17 23 20 20 8 19 33 
playing 8 9 9 8 9 9 18 5 3 
Girl 
going to school 30 36 32 32 33 35 30 40 33 25 
working, helping parents 43 40 49 47 44 41 50 28 44 59 
future reference 10 12 10 11 11 12 9 6 13 12 
playing 4 $ 6 6 5 6 S|! da 3 1 
Trees 
food 65 | 70 | 72 | 68 | 70 | 72 | 66 | 63 | 74 | 71 
wood 10 | 10 5 8 8 7 9 8 Tg 
shade 2 3 2 2 2 2 2 2 2 2 
climbing 3 1 1 1 2 1 2 3 1 0 
Wood 
fire 40 27 34 33 34 39 28 38 38 24 
building 21 22 16 17 17 16 21 16 13 
furniture 33 48 18 34 32 41 25 16 36 45 
Dogs 
guarding, watching 59 68 65 65 63 70 58 37 68 
barking, biting 17 | 14 | 25 16 | 21 13 24 | 40 11 s 
bunting 1 1 10 9 6 10 6 5 9 8 
pets, playing 3 5 3 3 5 6 4 1 
Cats 
catching mice 62 | 67 
meowing 13 13 = a a 4 3 33 69 82 
pets, playing 10 7 6 5 4 7 E É i 
Birds 
eating 40 47 24 
flying, singi 28 36 38 34 40 28 40 45 
Ai e ging i 7e 50 3 F 7 36 K s> 25 
Stouss = 
throwing, hitting 18 14 
Ping 66 71 72 7 g i 14 25 8 5 
roads 4 2 3 i 63 50 71 88 
1 5 5 3 2 
Sun 
drying 10 
© 11 9 9 
warming 32 26 6 12 10 11 5 
light 3 30 | 23 | 38 | 27 | 30 30 | 30 
8 | 46 | 57 | 52 | a2 | 35 42 | 34 | 50 | 37 


Sa 
my 
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TABLE 2—Continued 


Percentage Frequencies of Responses 


Stimulus 
i iv: i 9-10 
Words and Use—Categories Nos Chris Arme- Boys | Giris Private Public s6 yrsi| 1-8 ys 29 
Sand 
plaving 2 | 20 | 13 | 16 | 21 | 28 | 8 | 30 | 18 |7 
eee so | 63 | 60 | 66 | 50 | 51 | 64 | 44 | 58 | 72 
Rain x a 
growingiplants 4s | so | 43 | si | 42 | 63 | 30 | 25 
providing water 17 15 17 15 17 13 18 13 19 16 
Gold 
decorative co | s | sz | 39 | 72 | 49 | 02 | 49 | 32 j6 
economic 25 28 25 46 12 35 16 17 31 


boy (or a girl) is (a) to go to school, (b) or to work, 
help at home, assist mother and father, do chores 
and run errands, (c) or to grow up in order to do 
something in the future (become a man or woman, 
earn money when he grows up, etc., (d) or to play. 

Among the Lebanese, going to school comprises 
379% of the answers referring to the boy and 33% 
of the answers referring to the girl. Among the 
American and Sudanese groups school is mentioned 
much less frequently, although all of the Ss were 
attending school when questioned, Evidence from 
other studies, to be reported separately, indicates 
that schooling has a very high value among Leba- 
nese parents and children. 

Working (including the helping of parents) 
stands relatively high in frequency in all groups. 
In all groups working is mentioned more often as a 
function of the girl than as a function of the boy. 
Preparing for future has frequencies roughly equal 
in all groups, except that among the Sudanese Ss, 
most of whom were boys, preparation for the future 
on the part of girls received no mention. 

Play seems to be the outstanding function of 
American children. It is said to be the purpose of a 
boy in 34% of the instances and of the girl in 27% 
of the cases. The corresponding Lebanese fre- 
quencies are 9% and 5% ($ of A-L for boys and 
girls each < 001). This finding, too, is congruent 
with other data to be presented in subsequent 


reports. 


No mention of play occurred in the Sudanese 


answers to these questions. No supporting data 
are available, but the rarity of Sudanese references 
to play strongly suggests either that play is zelas 
tively infrequent, or that it is disapproved of, or 
both, among this group. 


Trees 

of the Lebanese Ss state that 
od. The high proportion of 
anese is no doubt 
has a great variety 


Sixty-nine per cent 
trees are to provide fo 
such answers among the Leb 
related to the fact that Lebanon 


of food-bearing trees, including olives, almonds, 
hazel nuts, walnuts, dates, oranges, lemons, grape- 
fruit, bananas, cherries, peaches, plums, and 
apples. Even pine trees are prized for edible nuts. 
But American children, living in the same environ- 
ment, mention the food-producing function of 
trees only half as often (34%) (A-L $ < .001). 
This finding demonstrates again that answers are 
derived from culture rather than from the physical 
environment. 


The shade-producing function of trees is men- 
tioned more often by the Americans and the Suda- 
nese (10% and 12%) than by the Lebanese (2%) 
(p of S-L = 01, p of A-L < .001). Climbing trees, 
which is a play function, has a frequency of 12% 
among the Americans, 1% among. the Lebanese, 
and zero among the Sudanese. Again, since Amer- 
ican and Lebanese Ss were all living in Beirut, it 
can be assumed that the differences were not pro- 
duced by differences in natural environment. 


Wood 


Wood as a fuel is frequently mentioned by the 
Americans and the Lebanese, but only seldom by 
the children of Sudan where heated buildings are 
unknown. Wood as a material used in building is 
much more familiar to Americans than to the other 
two groups. No Lebanese or Sudanese homes are 
built of wood, this scarce material being reserved 
chiefly for doors and window frames. Sudanese 
homes are frequently surrounded by an adobe 
wall, entrance to which is by a wooden gate. It 
will be noted that wood used for gates 1s referred 
to only by the children of the Sudan. 


Cats and Dogs 

In the Near East and in certain other parts of 
the world cats and dogs are not pets as they are 
in Europe and America. In the Near East they are 
treated in much the same way as are other domestic 
animals whereas in the Western world they have 
a special status. The Lebanese and Sudanese 
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answers reflect the fact that if a dog is kept it is 
for the purpose of guarding the house and warning 
against intruders. A dog is occasionally used for 
hunting. In consonance with these adult notions, 
guarding and hunting make up 62-66% of the 
Lebanese and Sudanese responses. Among the 
Lebanese, to play with a dog and to pet a dog are 
referred to only by the younger children, among 
whom such answers occur in 4% of the cases. Atti- 
tudes towards dogs do not differ as between the 
Lebanese Moslem and Christian groups (see 
Table 2). 2 

In contrast, 49% of the American answers 
indicate that dogs are pets and are to be played 
with (A-L p < .001). Among the Americans such 
answers are as frequent among the older children. 
as among the younger. 

The differences among the groups in regard to 
cats is similar to that prevailing in reference to 
dogs. For the Lebanese and Sudanese the primary 
feline purpose is the catching of mice and rats. 
Occasionally the catching of insects by cats is 
mentioned. When specific insects are mentioned, 
they are usually cockroaches. Reference to cats 
as animals to play with or to pet occurs infre- 
quently in these groups (6-7%) but 52% of the 
American answers fall into this class (p < .001). 


Birds 


For some children in all groups, birds are to 
fly, to sing, and to be put in a cage. But 37% of 
the Lebanese Ss state that birds are to be killed 
and eaten. This reflects the Lebanese custom of 
shooting and eating many birds which Americans 
do not classify as game birds. The reader may note 
again the prevalence of responses referring to food 
on the part of the Lebanese. The Sudanese mention 
birds as food less frequently (16%) than the Leba- 


nese, but more frequently than the Americans 
(1%) (A-L p < 001). 


Stones 


In all groups stones are primarily to build 
with, 38%, 73% and 53% of the answers, respec- 
tively, being in this category. This answer is not 
surprising in countries in which stone has been the 


most widely used building material for thousands 
of years. 


“Stone” was included in 
part because stones can be used aggressively, In 
this regard only small differences appear among 
the geoups. Using stones for throwing or hitting 
makes up only 13% of the Lebanese, 19% of the 
American, and 2% of the Sudanese responses. That 
the frequency of this answer is low among the 

Sudanese seems to be due to the greater age of the 
Sudanese subjects. Age is a major variable affecting 
aggressive verbal responses in regard to stones 


the stimulus list in 
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(see Table 2). Among the Lebanese responses, 
throwing and hitting make up 25% of the re- 
sponses of the 5-6 year olds to the word “stone.” 
Beyond this age such responses become infrequent 
in both Lebanese and American groups, apparently 
indicating strong disapproval of overt aggression. 


The Sun 


As might be expected the sun’s predominant 
functions are to provide warmth and light. On this 
item one of the major group contrasts lies in the 
fact that the Sudanese Ss referred to the heat- 
providing function of the sun much less often than 
the other groups (9% as contrasted to 25-28%) 
(A-S p < .001). Apparently, the Sudan is ordi- 
narily so hot that the child thinks the sun’s contri- 
bution is not useful! 

Another service provided by the sun is that of 
drying clothes. This is mentioned by 9% of the 
Lebanese, 2% of the Sudanese, and none of the 
Americans. Presumably, more clothes are dried at 
home in the former groups. Table 2 indicates that 
among the Lebanese this answer is more prominent 
among the very young (10%) than among the 
oldest group (5%) and more frequent among the 


poorer than among the more well-to-do (12% 
and 6%). 


Sand 


Because of very extensive building operations 
which are under way in Beirut, construction work 
is familiar to all Beirut children. The use of sand 
in building is mentioned by 58% of the Lebanese. 
American children living in Beirut mention con- 
struction much less often (21%) (p < .001). 

One of the major group differences in regard to 
sand involves play. Playing in sand is mentioned 
by 41% of the Americans, 18% of the Lebanese 
(A-L p < .001), and 2% of the Sudanese. The 
generally high rating of the Americans on play 
responses has been noted in connection with several 
earlier questions. The low play rating of the Suda- 
nese Ss on this particular question is in part due 
to their ages, since sand play is uncommon at ages 
9 and 10. Among the Lebanese, play as a use for 
sand has a frequency of 30% at ages 5-6 and of 
only 7% at ages 9-10. It will be noted that the 


frequency for American children for a 
e es 5- 
combined is higher than the apan 


frequency at 
5-6 among the Lebanese, $ "ge 
Rain 


Rain provided no 
interest. In all groups, 
buted to rain is that of 
This response is highes 
other two groups gav 
fell into the miscellan. 


group differences of special 
the most common use attri- 
aiding the growth of plants. 
t among the Americans. The 
e many more answers which 
eous or unclassified category. 
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- the age levels represented 
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Gold 


Almost all answers concerning gold fall into two 
classes: Those referring to decorative uses such as 
the making of rings, bracelets and earrings, and 
those referring to economic uses, such as state- 
ments to the effect that gold is to sell, or to buy 
things, or to make money, or to make one rich. i 

The Sudanese frequently mention decorative 
uses and seldom mention economic ones. Americans 
give a reverse picture, economic responses pre- 
dominating. The Lebanese have intermediate re- 
sponse frequencies. These differences are probably 
related to variations in the use of gold ornaments 
among the three groups, and to variations in 
acquaintance with the monetary value of gold. 

In Lebanon, chocolate candies are frequently 
wrapped in gold foil. As a result, among the 
younger Lebanese children a not infrequent answer 
to “What is gold for?” is “To wrap candy in.” 


Comparisons of Lebanese Subgroups 


In the foregoing comparisons among the three 
main groups, reference has been made to Leba- 
nese subgroups only when it clarified the main 
group contrasts. Let us now examine in more 
detail the data contained in Table 2. 


While Lebanon is noted for its cultural diversity 
it appears from Table 2 that the children of Mos- 
lems, Armenians, and Arab Christians have much 
in common. The first three columns of Table 2 
present data for these groups. In most instances, 
the percentages for Moslems and Arab Christians, 
who have lived side by side for centuries, are al- 
most identical. Since the majority of Armenians in 
Lebanon have immigrated to this country since 
the Turkish massacres of 1914, they have had a 
relatively short time in which to become accul- 
turated. Their responses differ somewhat from 
those of the other Lebanese groups. But they di- 
verge much less from the Moslems and Arab Chris- 
tians than do the Americans and the Sudanese. 

So far as the answers given by the two sexes are 
there is a general similarity at 
in this study. Only a few 
appear. In regard to the 
more than do boys (38% 
s more often 


concerned, again 


significant sex differences 
sun, girls stress warmth € PE 
vs. 23 < 001). Playing in san 
a by the girls (21% vs. 16% p < .001). 
The largest difference appears 1 regard to gold. 
Decorative uses are given by 72% of the girls and 
only 39% of the boys ($ < 001). 

Age affects the frequency of responses to nearly 
every question. Thus, eating as a function of the 
hands decreases with age, and writing increases. 
Play in general decreases, whereas work increases. 
The reader with a special interest in age differences 
will wish to examine Table 2 in some detail. It will 


not be feasible to comment on age changes in re- 
spect to each item. Concepts of use strongly reflect 
developmental age. This fact underlines the im- 
portance of equating for age any groups which are 
to be compared. The reader should bear in mind 
that the American and the Lebanese groups were 
equated for age composition. The fact that the 
Sudanese group consisted entirely of 9- and 10- 
year olds has been taken into account in making 


all comparisons. 
Hypotheses Concerning the Lebanese Child 


The data have been seen to reveal a consider- 
able number of differences among the three 
major groups of Ss. It remains to attempt to 
characterize in a general way the answers of 
each group. Such a general characterization is 
warranted only for the Lebanese and the 
American groups in view of the small size of 
the Sudanese sample and the fact that it repre- 
sented only one age level. 

It has already been observed that the Leb- 
anese answers seem to show a considerable 
concern with food. Common answers include 
the following: mother feeds us, father provides 
food, the mouth and the hands are used for 
eating, trees produce food, and birds are to be 
eaten. The answers taken by themselves do not 
show why food is so important. There is no 
reason to believe that the Ss suffered from mal- 
nutrition. Actually, the majority of Lebanese 
children appear to be very well fed. Observa- 
tions other than those presented here support 
the tentative suggestion that food preparation 
and eating have for the Lebanese a pleasure 
function that is comparable to the positive 
valuation given by some societies to other ac- 
tivities such as music, conversation, or visual 
esthetic experiences. 

In the second place, the Lebanese responses 
seem to indicate that industriousness is highly 
valued. Note the following common answers: 
father and mother work, girls and boys work, 
the hands are for work, dogs work by guarding 
the house, cats work by catching mice and 
rats. Trees and birds provide food, which may 
be thought of as a work function. Stones and 
sand supply building materials. Much of the 
world seems to be oriented toward utilitarian 
ends. 

Concern for language facility and scholastic 
performances seems evident, particularly 
among the private school pupils. The mouth is 
for talking, the hands for writing, and one of 
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the chief functions of both boys and girls is to 
go toschool. ; 
Few other emphases are observable in the 
Lebanese answers. There is some mention of 
play in connection with questions about boys 
and girls, and the question about sand, but 
little or no mention of it in reference to parents, 
hands and mouth, oranimals. There is no men- 
tion of art, of music, of athletics, or of sensory 
pleasures other than eating. There is little indi- 
cation of valuation of emotional life or of group 
cohesion. There are no references to competi- 
tiveness in areas other than school achieve- 
ment. There is nothing to suggest that religious 
observances or religious affiliations are highly 
significant. 


Hypotheses Concerning the American Child 


The present data, it is important to remem- 
ber, are restricted to American children in 
Lebanon, and more specifically, to those at- 
tending a specific progressive school. The 
parents are nearly all college graduates, and 
most seem to have accepted the doctrine of per- 
missiveness. While many groups of children 
living in America probably would answer as do 
the American group in Lebanon, not all groups 
of children in America would do so. 

The Beirut colony of American children (at 
ages 5-10) may be characterized, from their 
answers, as being greatly concerned with play. 
To a much greater extent than among the 
Lebanese Ss, the hands are for play, boys and 
girls have play as their purpose, trees are for 
shade and climbing, dogs and cats serve as pets 
and playmates, birds are to be enjoyed, and 
sand serves a play function. In contrast to the 
seriousness of the Lebanese children, life for the 
American children in Beirut would seem to 
constitute a perpetual vacation. 

Positive mention of play is associated with 
low frequencies of reference to work and to 
duties. Going to school, and helping parents 
are reported as functions of boys and girls 

much less often by Americans than by the 
Lebanese. The function of the father as a 
bread-winner is less frequently cited by the 
American group. As noted Previously, dogs and 
cats are for fun and are seldom thought of as 
utilitarian animals. Similarly, the use of stones 
and sand as building materials is stressed much 
less by the American group. Apparently the 
American child sees the world more in terms of 
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pleasure and less in terms of duty and work 
than does the Lebanese child. | 
This difference appears not to be due to any 
appreciable extent to differences in social class. | 
If the American group is compared with the 
Lebanese private school group (Table 2) rather 
than to the entire Lebanese sample the differ- 
ences remain substantially unchanged. It is 
also true that Americans diverge as much from 
Lebanese Christians as from Lebanese Mos- 
lems, these two groups scoring very similar per- 
centages in the response categories to every 
question (see Table 2). | 
The categories listed in Table 1 include only | 
the most common answers. The responses that 
are not represented in this Table were given by 
only a few children. They may therefore rea- 
sonably be called individual or unconventional 
answers, and may represent individuality or 
originality. If one adds for each question the 
percentages of Americans and Lebanese giving 
answers in the categories of Table 1, one finds 
that to fourteen of the sixteen questions the 
Lebanese gave a larger proportion of common, 
orstereotyped answers. The American answers, 
in general, show more variety. This finding may 
represent the result of permissiveness, it may 
point to more varied experiences, or it may be 
the result of direct encouragement of individual 4 
interests and modes of thought. This difference 
seems to the investigator to be among the more 
interesting ones revealed by the study. ° 


SUMMARY 


A technique for the study of children’s con- 
cepts of the uses of common objects has been 
presented which is believed to þe universally 
applicable. Its use has been illustrated with 
American and Lebanese groups and to a lesser 
extent with Sudanese Ss. The study has demon- 
strated that in regard to universally present 
objects, such as hand: 
cats, sun, sand and stone, children of different 
cultures differ in their 
years of age. It has also 
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the connection between measured author- 
_ ilarian attitudes and behaviors referred 
to in the literature as rigidity and intolerance 
of ambiguity, the existence of a true covariance 
is still in doubt and the antecedent conditions 
necessary for such a covariance remain unclear. 
Early research by Rokeach (9) and Frenkel- 
Brunswik (1, 7) obtained positive relationships 
between California E- and F-scale scores and 
intellectual and perceptual measures of rigidity 
and intolerance of ambiguity. Numerous sub- 
sequent studies have failed to substantiate 
their findings. The view that an atmosphere 
of ego-involvement or personal stress may be 
a necessary condition for obtaining such rela- 
tionships has been recently advanced (3). 
Unfortunately, experiments designed to test 
this view have merely contributed additional 
conflicting evidence (5, 6). The problem has 
been complicated further by suggestions that 
induced ego-stress in itself is sufficient to 
produce rigid and ambiguity-intolerant be- 
havior in experimental subjects (Ss), inde- 
pendently of supposedly predisposing person- 
ality traits. Research regarding this point has 
likewise been equivocal (3, 4, 6, 10). 

The number and complexity of contradictory 
experiments in this area make it impossible to 
distinguish common factors differentiating 
studies that obtain a positive relationship be- 
tween the California scales and rigidity and 
ambiguity-intolerance from those in which 
such a relationship has not been found. Some 
of the difficulty can . attributed to differ- 
ences in experimental procedures used to 
‘nduce ego-stress and perhaps even more from 

failure to evaluate methods for operationally 

fining rigidity and intolerance of ambiguity. 
.go-involvement has usually been adminis- 


I SPITE of extensive research dealing with 


+? This paper is adapted from a dissertation submitted 
in candidacy for the Ph.D. degree at the University of 
Connecticut. The writer is indebted to Maurice L. 
“arber, thesis advisor, and W. L. Jenkins for assistance 
the present formulation. A report of this study was 
“en at the APA meeting, 1954. 
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tered to large groups with no attempt made to 
evaluate objectively the effectiveness of ex- 
perimental instructions. Tasks selected to 
measure rigidity and intolerance of ambiguity 
have been many, most having little more in 
common than the name. Few of the methods 
have experimentally differentiated the be- 
havioral effects of ambiguity-intolerance from 
those of rigidity, and the majority appear not 
to measure variables relevant to the California 
theory of authoritarianism (1). 

The present study was designed to introduce 
some clarity by: (a) administering ego and 
task orientations individually, and running 
postexperimental checks on their effectiveness, 
and (b) operationally defining and distinguish- 
ing between intolerance of ambiguity and 
rigidity in a manner appropriate to theoreti- 
cally posited authoritarian characteristics. 

In terms of the experimental tasks con- 
structed for the present research, intolerance of 
ambiguity refers to the tendency to respond 
to previously unexperienced and unstructured 
stimuli with a subjectively structured response 
set. Rigidity refers to the tendency to resist 
modifying an established response set when 
the stimulus conditions under which it was 
established have been changed. To assure that 
resistance to change is attributable to person- 
ality factors and not to initial differences in 
the habit strength of sets, experimental control 
of set strength among Ss is necessary. This 
control is also essential in order to differentiate 
the behavioral effects of ambiguity-intolerance 
and rigidity. For example, if Ss in an unstruc- 
tured stimulus situation are given an equal 
number of trials to establish a response set, 
Ss high in ambiguity-intolerance may be ex- 
pected to establish a stronger set than Ss low 
in ambiguity-intolerance because of their 
tendency to acquire it earlier and apply it 
more consistently. Under these conditions, it 
is psychologically meaningful to speak of 
rigidity, as gauged by resistance to extinction, 
only when differences in set strength among S's 
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are equalized. If rigidity is to serve as a moti- 
vational concept, it must contribute to resist- 
ance to extinction above and beyond simple 
habit strength. Unfortunately, many of the 
operational measures of rigidity currently in 
vogue fail to assess and control for differences 
in initial set strength. : i 
In lightof these considerations, the following 
questions were posed for experimental anal- 
ysis: 1. Do authoritarians subjectively estab- 
lish a set or norm in a previously unexperienced 
and unstructured situation more rapidly than 
equalitarians? 2. After establishing an equally 
sirong response sel, are authoritarians more 
resistant to modifying their set than equali- 
tarians when stimulus conditions are changed? 
The experiment was further designed to 
answer these questions: 3. Does induced ego- 
involvement increase relationships among 
measures of authoritarianism, ambiguity- 
intolerance, and rigidity? 4. Does ego-involve- 
ment in itself produce greater rigidity and 
intolerance of ambiguity than nonego-involve- 
ment? 


METHOD 


The Ss whose scores on the standard 30-item Form 
40-45 California F scale (1) were known, were asked to 
participate in a one-session laboratory experiment in- 
volving two series of judgments of autokinetic move- 
ment. The autokinetic phenomenon satisfied the 
requirements of unstructuredness and freedom from 
related prior experiences. The fact that positive results 
emerged from the only previous study relating the 
California scales to autokinetic judgments (2) gave 
further evidence of its appropriateness as a research 
instrument. 

Series 1 consisted of judgments of the direction and 
distance of movement of a one-light autokinetic stimu- 
lus and was designed to measure the speed of norm or 
set formation; scores were represented by the number 
of stimulus presentations necessary before S attained 
a standard experimental criterion of establishing a 
response set. The criterion required that 10 out of 13 
consecutive judgments be reported as moving in only 
one of four primary directions (up, down, left right) 
with a magnitude varying not more than one aan from 

one another. On the basis of Previous research the 
assumption is warranted that barring significant 
changes in the stimulus field, Ss would retain this re- 
sponse set as a frame of reference for subsequent judg- 
ments. Series 1 was ended for each S whenever a se- 
quence of his judgments met the criterion. Although 
the number of trials to reach the criterion varied among 
Ss, the same degree of consistency in judgments was 
required of all Ss as the basis for terminating Series 1. 
Set strength among Ss was thus operationally equated 
at the conclusion of Series 1, and since the second series 


followed immediately, Ss began Series 2 with equal 
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TABLE 1 
F-ScatE Groups SELECTED FOR EXPERIMENTATION 
Approximate Range of Grow 

Group N F-Scale F-Scale F -Scale 
Centiles Scores Mean 

Lio 14 0-10 1.6-2.9 2.52 
L20 15 11-20 3.0-3.1 3.05 
M 13 45-55 3.5-3.8 3.67 
Ho 14 80-89 4.34.5 4.35 
Hio 13 90-99 4.6-5.2 4.84 


habit strengths in their set for the one-light situation. 
During Series 2, a second light was added to the stimu- 
lus field in conjunction with the first light. The Ss were 
simply told at this point to continue judging the dis- 
tance moved by the first light. Preliminary experi- 
ments (8) had found that the simultaneous viewing of 
two lights typically curtailed movement from that per- 
ceived under one-light conditions. Series 2 consisted of 
50 judgments and was designed to determine the extent 
to which S shifted from his established one-light norm 
or set when viewing the two-light stimulus. 

Two operational measures were obtained in the 
autokinetic sequence: intolerance of ambiguity by the 
speed the norm was formed, and rigidity by the relative 
distance toward zero S shifted from his one-light norm 
during the two-light series. 

Approximately half of the Ss received instructions 
designed to create an ego-involving atmosphere, i.e., 
the impression that the tasks reflected their abilities 
and intelligence; the other half received instructions to 
create a task-involving orientation, i.e., the impression 
that the experiment was designed to try out apparatus 
for subsequent research on vision. A posttest question- 
naire was administered to check on the effectiveness of 
the induced orientations at the close of the laboratory 
session. Detailed descriptions of the procedure and 
questionnaire may be found elsewhere (8), 


Subjects 


The Ss were 69 students enrolled in an introductory 
psychology course. They were assigned at random to 
one of the two experimental orientations from five 
divisions of a California F-scale distribution of 162 
students. Table 1 presents the centile and F-scale 
score range for these divisions. Nine Ss failed to estab- 
lish a norm within the experimentally predetermined 
maximum of 150 Series 1 stimulus presentations, Norm 
formation scores for these Ss were arbitrarily set at the 
maximum of 150. Of the remaining 60 Ss who had met 
the criterion of norm formation prior to the 150th trial, 


two subsequently complained of illness and asked to be 
excused during Series 2. 


RESULTS 


; An answer to the question of whether author- 
Itarlans are more ambiguity-intolerant and 
rigid than nonauthoritarians can be gathered 
from Tables 2 and 3. The analysis of scores 
on trials to form a norm, contained in Table 
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TABLE 2 
ANALYSIS OF VARIANCE OF TRIALS TO FORM A 
Nors 
(Scores transformed to reciprocals) 


M 
Source df Sate F È 
Orientation 1 2.00 .61 | n.s. 
Authoritarianism 4 33.95 | 10.77 | .001 
Interaction 4 1.05 .32 | n.s. 
Within 59 3.30 
TABLE 3 
ANALYSIS OF VARIANCE OF PERCENTAGE OF SHIFT 
From Norm 
Mean 
Source df Square F $ 
Orientation 1 | 13,922.5 | 26.05 | .001 
Authoritarianism 4| 1,293.2 | 2.42] .06 
Interaction 4 657:9 | 1.23] ns. 
Within 48 534.3 


2, indicates a clear-cut relationship (F = 10.77, 
p < .001) between intolerance of ambiguity 
and authoritarianism. The relationship be- 
tween authoritarianism and percentage shift 
from norm, noted in Table 3, yielded an F of 
2.42 (p < .06). Although suggestive, this find- 
ing does not offer clear support for a corre- 
spondence between the California scale and 
rigidity. 

The second major question of the study was 
whether induced ego-involvement would in- 
crease relationships among the measures. 
Figure 1 illustrates the mean number of trials 
to norm formation for each of the five F-scale 
groups under both ego and task orientations. 
The relationship between rigidity and F-scale 
scores in terms of percentage of shift is por- 
trayed in Figure 2. It can be seen from this 
figure that the relationship between F-scale 
scores and rigidity is more marked in the ego- 
oriented group. Task-oriented authoritarians, 
for example, evidence little more rigidity than 
task-oriented equalitarians; the difference in 
the ego-oriented condition, in contrast, is 
quite pronounced. The interaction Fs of .32 
and 1.23 noted in Tables 2 and 3 indicate, 
however, that orientation factors have not 
been demonstrated to play a statistically sig- 
nificant role in obtaining the generally high 


relationships found between authoritarianism, , 


ambiguity-intolerance, and rigidity. 


The effect of induced ego-involvement in 
producing increased ambiguity-intolerance and 
rigidity among experimental Ss is also illus- 
trated in Figures 1 and 2. Although the 9.8 
mean difference between ego- and task-oriented 
groups in trials to norm is suggestive, it is clear 
from Table 1 that the difference (F = .61) does 
not attain statistical significance. In contrast, 
a 21.1 per cent greater resistance to shift on 
the part of the ego-oriented group is signifi- 
cantly greater (F = 26.05) than that shown 
by the task-oriented Ss. This difference is 
independent of F-scale scores. Figure 2 clearly 
shows that all of the ego-oriented F-scale 
groups evidenced considerably more rigidity 
than comparable groups receiving the task 
orientation. 
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The introspective self-involvement _Tatings 
of the posttest questionnaire established a 
significant difference in the direction expected 
(F = 11.56) between ego- and task-oriented 
groups. The questionnaire indicated further, 
that task-oriented Ss showed an appreciable 
degree of concern regarding their personal per- 
formances in spite of the experimental in- 
structions used. No systematic difference was 
found among the F-scale groups in receptivity 
to the two testing orientations. 


Discussion 


The fact that authoritarians formed norms 
in the unstructured autokinetic situation more 
rapidly than nonauthoritarians under both ego- 
and task-orienting conditions clearly supports 
Frenkel-Brunswik’s position (1, 7) concerning 
the characteristic ambiguity-intolerance of 
authoritarians. The general finding that au- 
thoritarians modified their established norms 
to an appreciably lesser degree than non- 
authoritarians, offers additional support for 
the association between rigidity and authori- 
tarianism asserted by the California group. 
There appears, however, to be a limitation to 
this covariance. Although authoritarians mani- 
fested greater rigidity than nonauthoritarians 
in the ego-orienting atmosphere, no such dif- 
ference was evidenced in the task group. 
Though perhaps a favored mechanism it 
would appear that rigidity in the authoritar- 
ian is partially contingent upon variables asso- 
ciated with ego-involvement. What factors 
mediate this relationship is problematic. 

Although induced ego-involvement cannot 
be considered to have played a Statistically 
significant part in contributing to relationships 
found between authoritarianism, ambiguity- 
intolerance, and rigidity, it is of interest to 
note that had the experiment been conducted 
under task-orienting conditions only, the re- 


lationship between rigidity and the 
would have been 


(F = .462). In co 


F scale 
reported as not significant 
trast, had the experiment 
been run under €go-orienting conditions, the 
results would have been considered significant 
(F = 340; p < -03). Although minimal faith 
should be put in an hypothesis when the 
statistic called for by the experimental design 
turns up a nonsignificant effect, it seems wise, 
in light of the above facts, to reconsider the 
possible contribution of ego-involvement in 


the rigidity-authoritarian relationship. This 
caution appears particularly justified since the 
appreciable degree of ego-involvement found 
among our task-oriented Ss made it impossible 
to evaluate experimentally the full effects of 
the ego-involvement variable. Had the differ- 
ence between the two orientations been more 
marked, its contribution to authoritarian ri- 
gidity may have been more clearly established. 
In any event, further study of ego-involvement 
in this context would appear warranted. 

A major question underlying the present 
study was whether personal stress or ego-in- 
volvement would increase the general level of 
ambiguity-intolerant and rigid behavior among 
Ss regardless of their standing on the F scale. 
Although the influence is a dubious one as 
regards intolerance of ambiguity, there can be 
no doubt as to its effects on behavioral rigid- 
ity. Whether ego-involvement was in itself 
sufficient to bring about the increase cannot be 
readily determined. The following speculations 
regarding the connection may nevertheless be 
offered: Ss who were ego-involved with respect 
to their one-light norm were motivated to 
ward off stimuli that conflicted with or ques- 
tioned the adequacy of this norm. Because of 
its tendency to curtail the impression of move- 
ment, the second light had the effect of chal- 
lenging the S’s established one-light norm. 
Inasmuch as past judgments presumably re- 
flected intelligence and ability, Ss, in an 
attempt to safeguard self-esteem and reassure 
themselves of the adequacy of their one-light 
performance, warded off the visual influence 
of the second light. Task-oriented Ss, not 
feeling a personal identification with their 
performances in the one-light situation had 
little or no reason to retain or defend their 
norm. The thesis suggested then is that ego- 
involvement in the context of the laboratory 
tasks of this experiment served to motivate Ss 
to ignore or reject what was antithetical to 
their first-formed impressions. It would there- 
fore seem plausible to hypothesize that the 
effect of a norm-changing stimulus would vary 
with its acceptability to an individual and 
that the more ego-involved an individual is 
with respect to his norm, the more intense 
will be the operation of his def 
stimuli that challenge it. 


The above interpretation of rigidity as a 
defensive reaction to the threat posed by the 


enses against 


r 
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two-light stimulus adds an additional inter- 
pretive problem. Since authoritarians evi- 
denced considerably greater rigidity than 
nonauthoritarians in the ego-orienting condi- 
tion, the issue arises as to whether this differ- 
ence might be better attributed to greater ego- 
threat experienced by authoritarians than to 
greater rigidity manifested by them in response 
to equivalent degrees of threat. Because of its 
highly structured and limited nature, the 
posttest questionnaire offered no effective clues 
as to the validity of this hypothesis. If author- 
itarians were in fact more threatened than 
nonauthoritarians when the second light was 
presented, their greater resistance to change 
could be considered commensurate with the 
degree of threat they experienced. If this 
speculation is valid, authoritarians would be 
characterized better by their susceptibility to 
intense threat when their personal norms are 
questioned than by their “generalized mental 
rigidity.” 

Space does not justify a detailed review of 
how results of the present study agree or dis- 
agree with earlier research. Obviously, there 
are striking differences between experimental 
techniques used for measuring intolerance of 
ambiguity and rigidity. The fact that the 
present ego- and _ task-orienting instructions 
were given person-to-person rather than in a 
group setting, as is more usual, is a further 
obstacle to comparison. Until additional un- 
equivocal data are available to reconcile con- 
tradictory evidence from previous studies, it 
seems wisest to view new positive results 
concerning the relationship of rigidity and 
tolerance of ambiguity to authoritarianism 
with caution. Clearly, these behaviors are in- 
fluenced not only by personality factors, but 
also by the tasks utilized for measurement and 
the situational atmosphere within which the 


S must respond to them. 
SUMMARY 


Individual laboratory sessions involving 
judgments of autokinetic stimuli were con- 
ducted with 69 college students whose stand- 
ings on the California F scale were known; 
approximately half received ego-orlenting 1n- 
structions, half received a task orientation. 
Two behavioral scores were obtained during 
the experimental sequence: (a) the number of 
trials to form a norm while estimating direction 
and distance of a one-light autokinetic stimu- 
lus, as a measure of intolerance of ambiguity, 


and (b) the percentage of appropriate shift 
from the S’s established one-light norm upon 
the introduction of second light, as a measure 
of rigidity. 

Analyses of variance of these measures 
produced the following findings: 

1. (a) An F beyond the .01 level of confi- 
dence between authoritarianism and intoler- 
ance of ambiguity. 

(b) An F attaining the .06 level between 
authoritarianism and rigidity. The relation- 
ship was clearly significant in the ego-oriented 
group, but not in the task-oriented group. 

2. An F beyond the .01 level of confidence 
signifying greater rigidity in the ego- than the 
task-oriented group. 

3. Ego-involvement did not significantly 
increase relationships among the three experi- 
mental measures, nor did it produce a general 
increase in ambiguity-intolerance. 

The results of the study were felt to be 
consonant with positions taken by Frenkel- 
Brunswik regarding the typical rigid and 
ambiguity-intolerant behavior of authoritar- 
ians. Limitations to this relationship were 


noted and alternate hypotheses suggested. 
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monc the factors that have been inves- 

A tigated in studies of attitude change 

(e.g., 4, 5) are the personality of the 

person being influenced, and the source of the 

attitude change material. The present study 

was designed to test hypotheses in each of 
these general areas. 

According to the first hypothesis, attitude 
change is more likely to occur in “cognitively 
complex” persons than in “cognitively simple” 
persons. The concept of cognitive complexity- 
simplicity stems from the theoretical orienta- 
tion of Kelly (6), which assumes that each 
individual possesses a system of constructs for 
understanding and predicting his world. In 
comparison with a simple system, a complex 
system differentiates more sharply among 
ideas. It distinguishes among a greater number 
of different aspects of people or situations, 
employing more functional constructs and 
showing more variety in the relationships 
among them. 

A basic assumption in the present study is 
that the ideas expressed in a communication 
have meaning for its recipient and can there- 
fore be considered by him only if they are 
relevant to his existing personal constructs. 
The complex system, having more functional 
constructs, furnishes greater opportunity, 
statistically, for a new idea to be considered. 


Once considered, the new idea may be inte- 
grated into the person’s construct system to 
result in attitude change. In contrast, the 
cognitively simple system, equipped with fewer 
functional constructs, allows fewer opportuni- 
ties for new ideas to come in relation to a 
personal construct and to be integrated into it. 
New words may be manipulated, but unless 
the new idea is related to a relevant personal 
construct it can effect no change in the person’s 
construct system and, in a sense, has no mean- 
ing for him. 

This formulation resembles the Gestalt hy- 
pothesis that attitude changes normally occur 
because the “material to be judged is seen in a 
new light and has consequently changed its 
meaning” (7, P- 19). Both views emphasize 


the importance of the meaning of the communi- 
cation to the recipient. The present formula- 
tion sets up special conditions for “change in 
meaning” by stating that the new material 
must have relevance for an individual’s per- 
sonal constructs before it can be “seen in a 
new light.” 

The remaining hypotheses concern assimila- 
tive projection or assumed similarity (3, 8) as 
relating the source of attitude change material 
to the person being influenced. Assimilative 
projection refers to the degree to which a 
person perceives other persons (the communi- 
cation source in this case) as similar or dissim- 
ilar to himself. If a person perceives the source 
of a new idea as similar to himself, he is likely 
to see the idea also as similar, in some ways, tO 
his own ideas, and thus consider it “correct.” 
He should therefore be more susceptible to 
influence by a particular source group if he 
perceives that source group as similar to him- 
self. In the present study, two source groups 
are used—authority and peers. Hypothesis Il 
states: Persons who are susceptible to peer 
influence are more likely to perceive peers as 
similar to self than persons who (a) are nega- 
tive to peer influence, or (b) are susceptible to 
authority influence. Hypothesis III states: 
Persons who are susceptible to authority in- 
fluence are more likely to perceive authority 
figures as similar to self than persons who (a) 
are negative to authority influence, or (b) are 
susceptible to peer influence. 


METHOD 
Attitude Change Measure 


During another investigation by the authors (1) 
a 132-item attitude questionnaire was administered tO 
students (male and female) in two intermediate psy’ 
chology courses. Scattered throughout the question- 
naire were items from three modified Thurstone scales: 
attitudes toward the movies, tariff, and the punish- 
ment of criminals. The students were told the question- 
naire involved an investigation into the “psychology 
of normal people” concerned with their attitudes 0 
“every-day matters.” 

One month after the original testing session, four 
alternate forms of the second questionnaire were 
administered to the two classes. These questionnaires 
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were introduced with the following orally-delivered 
statement: 

Many investigations into the psychology of 
attitudes have dealt with issues generally given 
little thought or attention by people. We want to 
study attitudes that have been given some thought. 
In order to stimulate your thinking on these issues 
we have provided you with the opinions of other peo- 
ple. For each of the three attitude issues, first read 
the opinions expressed by these other people, 
then indicate your opinion on each item. 

Half of the questionnaires passed out to the classes 
Presented the opinions supposedly expressed by peers 
with this introduction: 

A national public opinion organization recently 
conducted a survey of the opinions of college stu- 
dents in the universities of the Big Ten... The 
survey showed a fairly high degree of consensus 
among the college students on several issues. On the 
following pages we have presented the attitudes of 
the great majority of college students. 

The remaining half of the questionnaires contained 
opinion statements supposedly expressed by generals. 
The students were told in these questionnaires: 

A writer for a national magazine recently inter- 
viewed Generals Bradley, Clark, and Taylor on 
certain issues of national interest. The generals were 
in complete agreement on these issues. On the follow- 
ing pages we have presented their attitudes. 

In contrast to the questionnaire administered in the 
first session, the items pertaining to a given issue, i.e., 
cither movies, tariffs, or the punishment of criminals, 
were presented together on a single page. Before each 
page containing the given attitude scale was a page 
describing the opinions of either the peers or the 
generals on this issue. To illustrate, the items in the 
attitudes-toward-movies scale were preceded in some 
of the booklets by a page containing this statement: 
“The great majority of the college students expressed 
favorable attitudes toward movies. They felt that 
movies often were very fine entertainment and fre- 
quently contributed to the education of the movie- 
goer.” For each attitude scale, half of the booklets 
contained favorable opinions by either the peers or 
generals while the remaining booklets contained un- 
favorable statements. The same opinion rationale was 
used for both the peers and generals. After reading the 
introductory statement to each issue, the subjects (Ss) 
indicated their opinions on the given attitude items by 


marking an IBM sheet. 


Cognitive Ci omplexity-Simplicity 


Cognitive complexity-simplicity was measured by 
the Role Conatct Repertory Test (RCRT), an 
instrument suggested by Kelly and used by Bieri (2) in 
his study on cognitive complextiy. The test consists of a 
grid across the top of which S writes the names of 22 
people personally known to him, who fit in prescribed 
role categories. The S is asked to consider three of these 
people at a time and to decide in what important way 
two of them are alike and different from the third. 
Different groups of three are considered successively by 
S until he has listed along the side of the grid a certain 
number of personal constructs derived from the original 
list of people. For each construct thus derived, the S 


then considers all the rest of the people in his list and 
places a check mark under the names of those people 
perceived as similar to the original two in terms of the 
construct. In this way the grid is formed with checks 
or voids which indicate the particular manner in which 
the S perceives similarities and differences within a 
group of people on the basis of his personal constructs. 
By comparing each construct row of check marks 
with each other construct row, one can determine their 
degree of similarity. If two rows have check marks under 
the same names, the constructs are functionally equiv- 
alent. When many of the construct rows for an S are 
alike, he is considered as cognitively simple. If he 
produces few rows that are identical or similar to any 
other row, he is regarded as having high cognitive 
complexity. The scores for cognitive complexity- 
simplicity were obtained by using the first 10 rows and 
first 10 columns in the standard RCRT.! Each row was 
matched with each other row. When a matching was 
identical, a score of three was given; when a matching 
was identical except for one, a score of two was given; 
and when except for two, a score of one. A high score 
indicates cognitive simplicity; a low score, cognitive 
complexity. : 


Assimilative Projection 

One of the persons which the S is instructed to use in 
the RCRT is himself. After the grid is completed, the 
column of checks and voids under the S’s name indicates 
how he perceives himself in regard to his own constructs. 
By comparing this self perception with his perception of 
other people as indicated by his test responses, his 
degree of perceived similarity, i.e., assimilative pro- 
jection, can be ascertained. To find assimilative pro- 
jection with peers, four figures were selected from the 
first ten on the grid to be matched with the self: (a) a 
brother, or brother-like person, (b) a sister, or sister-like 
person, (c) the present closest opposite-sex friend, and 
(d) the present closest same-sex friend. The assimilative 
projection score for peers was made up of the number 
of times a check or void under the self matched a check 
or void under each of the peers considered separately.? 

The assimilative score for authority figures was 
obtained the same way, using: (a) the mother figure, 
(b) the father figure, (c) the minister, priest, or rabbi 
with whom personal feelings about religion might be 
discussed, (d) the physician, (e) the teacher who in- 
fluenced S most when in the teens, (f) the S’s employer, 
supervisor, or officer during a period of great stress. In 
scaling the items, it was found that mother figure, 
father figure, and employer figure, individually, cor- 
related very little with the total score. In the subsequent 
analysis of data, therefore, only the religious person, the 
physician, and the teacher were used in determining the 
score on similarity to authority. The latter two were not 
among the figures included in determining the cognitive 
complexity score. 


1To facilitate scoring, the first ten rows and columns 
were used instead of the complete grid. 

2An analysis of the data revealed a slight negative 
relationship between assimilative projection on peers 
and cognitive complexity. Since both these variables 
are positively related to attitude change, this slight 
negative relationship indicates that the tests of the 
two hypotheses are independent of each other. 
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TABLE 1 
COGNITIVE COMPLEXITY-SIMPLICITY IN RELATION 
TO SOURCE OF INFLUENCE AND TYPE OF 
ATTITUDE CHANGE 


Mean Cognitive Complexity- 


Type of Attitude Simplicity* 
Change Generals Peers 
(N) assource (N) as source 
Little change (9) 16.44 (6) 35.50 
Variable change (5) 16.20 (9) 14.78 
Susceptible (7) 13.43 (9) 14.89 
Negative change (6) 8.83 


(12) 12.58 


Analysis of Variance 


Source of Variation DF MS F p 

Source of influence 1 200.16 = — 

Type of attitude 3 570.56 5.0 .05 
change 

Interaction 3 297.13 26 — 

Within groups 55 114.19 


* A high score represents greater simplicity. 


Analysis of Data 


Change scores from the first to the second testing 
were determined for each of the three scales. These raw 
scores were then converted into standard scores with a 
mean change score of 20 for each scale, and the three 
scale scores added to obtain a total standard suscepti- 
bility score. Since the mean change score over the three 
scales was 60, an S scoring higher than 60 was above the 
average in over-all susceptibility to change. Two groups 
of Ss were formed using only these total standard 
scores: (a) susceptible group, Ss having standard scores 
of 67 or more; (b) negative change group, Ss having total 
standard scores of 54 or less.* 

Ss whose total scores ranged close to the mean 
change were divided into two groups according to the 
variability of their reactions among the three scales. On 
the basis of a variability score, yielded by the sum of the 
differences between the S’s standard score on each scale 
and the corresponding mean standard score, the third 
and fourth groups were established as follows: (c) little 
change group, Ss having total standard scores of 55 to 
66 and variability scores of 7 or less; (d) variable change 
group, 5s having total standard scores of 55 to 66 and 
variability scores of 8 or more. The scores defining the 


3The standard scores were combined in order to 
obtain a measure of the individuals’ general reactions to 
the external communication. The Ss tended to be 
slightly influenced in the direction of the communica- 
tions. The raw score change means on the three at- 
titude scales were 0.0, 1.0, 2.0, and the SDs were 3.7, 
3.5, and 6.0, respectively. 

The susceptible Ss generally changed in the direction 
of the communication. Twelve of the 16 Ss changed 
toward the communication on all three scales; the 
other four were successfully influenced on two out of 
three scales. The negative change Ss tended to change in 
the direction opposite to that advocated by the com- 
munication. Sixteen of the 18 Ss changed away from 
the communication on two or three of the three scales. 


four groups were selected so that the Vs in the groups 
would be nearly equal. . 

To study the personality characteristics associated 
with the four kinds of reactions to the communication, 
the four attitude change groups were used in setting up 
analysis of variance designs with cognitive complexity- 
simplicity, perceived similarity to peers, and perceived 
similarity to authority as dependent variables in the 
analyses. If attitude change had been taken as the 
dependent variable for purposes of the analysis, these 
kinds of reactions to outside influence could not readily 
have been differentiated. An N of 63 in the complexity 
analysis was reduced to 61 in the assumed similarity 
measures because two Ss had not recorded any marks in 
the self column of the RCRT. Because of dispropor- 
tional cell frequencies, the analysis of variance pro- 
cedure suggested by Snedecor (9) was employed. 


TABLE 2 
ASSIMILATIVE PROJECTION ON PEERS IN RELATION 
TO SOURCE oF INFLUENCE AND TYPE OF 
ATTITUDE CHANGE 


Mean Assimilative Projection 


Type of Attitude on Peers 
Change Generals Peers 
(N) as source (N) as source 
Little change (9) 19.56 (6) 24.50 
Variable change (5) 20.80 (9) 21.89 
Susceptible o 20.50 (8) 23.75 
Negative change 6) 24.17 (12) 19.50 en 
Analysis of Variance 
Source of Variation DF MS F p 
Source of influence i 11.18 = — 
Type of attitude 3 5.21 — — 
change 
Interaction 3 67.28 42 05 
Within groups 53 16.34 


TABLE 3 

ASSIMILATIVE PROJECTION ON AUTHORITY FIGURES 
IN RELATION TO SOURCE oF INFLUENCE AND 

TYPE OF ATTITUDE CHANGE 


Mean Assimilative Projection 


Type of Attitude on Authority Figures 


Change 


Peers 
(N) as source 


Generals 
(N) as source 


Little change (9) 14.67 (6) 16.67 
Variable change (5) 13.60 (9) 15.67 
Susceptibility (6) 17.33 (8) 14.75 
Negative change (6) 12.67 (12) 12.92 


Analysis of Variance 


Source of Variation DF MS F p 

Source of influence 1 .08 = = 

Type of attitude 3 31.11 2.47 7 
change 

Interaction 3 1623 1.29 = 

Within 53 12.60 


Ps 


~— 
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RESULTS AND DISCUSSION 


Hypothesis I states that attitude change is 
less likely to occur in “cognitively simple” 
persons. Since the “little change” group con- 
tains persons who maintain their attitudes in 
the face of the outside influence, this group 
should have the greatest cognitive simplicity. 
The results of the analysis of variance given 
in Table 1 indicate that there are significant 
differences among the attitude change condi- 
tions when the communication source is not 
considered. The little change group has the 
highest cognitive simplicity means, tending to 
support the hypothesis. 

The mean cognitive simplicity in the “little 
change to peers” condition is significantly 
greater (p < .05) than the mean in the “little 
change to generals” condition, suggesting that 
the hypothesized relationship between little 
change in the attitude scale and cognitive 
simplicity is most pronounced when the source 
of the attitude influence is the Ss’ peer group. 
It may be that a “cognitively simple” person 
is not as resistant to change when the source 
is authority figures because of a need to con- 
form to these authority figures. Interestingly, 
the groups with the highest cognitive com- 
plexity are those negative in their reactions to 
both peer and authority persuasion. 

Hypothesis II states that persons who are 
susceptible to peer influence are more likely 
to perceive peers as similar to self than persons 
who are negative to peer influence or are sus- 
ceptible to authority influence. This hypothesis 
is supported by the significant interaction (p 
< .05) in the analysis of variance presented 
in Table 2. The Ss in the “susceptible to peers” 
condition perceive themselves more similar to 
peers than the Ss in the “negative to peers” 
condition (p < .05), while the direction of the 
differences between the susceptible and nega- 
tive conditions is reversed when generals are 
the source of influence. Although this latter 
difference does not attain significance, 1t may 
be that Ss who perceive themselves similar to 
peers react negatively to opinions advanced by 
generals. PE 

The difference in “assimilative projection 
on peers” between the Ss susceptible to peer 
influence and those susceptible to the generals 
is in the predicted direction, although it fails 
to reach significance. The high “mean assimi- 
lative projection on peers” in the “little change 
to peers” condition was not predicted. 


o 


Table 3 presents the mean scores for simi- 
larity to the authority figures in the eight 
groups. Analysis shows no significant effects, 
and Hypothesis III regarding similarity to 
authority is therefore not supported. Perhaps 
“authority” was considered too broadly to be 
psychologically meaningful. Reaction to au- 
thority may be different according to the kind 
of authority; for instance, conveyers of soci- 
ety’s values, charismatic leaders, or parental 
figures. 


SUMMARY 


Students in two undergraduate psychology 
courses were administered the same attitude 
scales on two occasions. The second adminis- 
tration was accompanied by information con- 
cerning the expressed opinions of either 
generals or peers. As hypothesized, those 
students showing little change in their attitude 
scale scores were found to be high on a measure 
of cognitive simplicity. Those students who 
were influenced by peers perceived themselves 
as more similar to peers than students who 
were negative to peer influence. An hypothesis 
relating assimilative projection on authority 
figures to influence by authority was not sup- 
ported by the data. 
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_A-PICTURE STORY (MAPS) TEST FINDINGS FOR 
oo RUBELLA DEAF CHILDREN! 


D. MARJORIE BINDON 
Canterbury University College, New Zealand 


LTHOUGH the Psychological Abstracts 
(down to 1955) do not report any 
personality studies done specifically on 
Tubella deaf children (children born deaf 
through the mother having German measles 
during pregnancy), many people closely con- 
nected with the deaf are of the opinion that 
these children behave differently from other 
defective hearing children. The writer, as a 
trained teacher of the deaf, has both taught 
and lived with hearing-impaired children and 
her experience has led her to query this belief. 
Tt is her view that rubella deaf children and 
nonrubella deaf children (those born deaf 
through causes other than maternal rubella or 
who had become deaf before the age of two) 
seemingly display similar personality charac- 
teristics but that, as a group, defective hearing 
children who have not naturally acquired 
speech and language do differ from the hearing 
children. 

To test this hypothesis the writer (2, 3) 
studied the personality characteristics of ru- 
bella deaf children by means of the Rorschach 
test using the Munroe checklist system of 
scoring (7). Her findings confirmed her hy- 
pothesis and revealed a common rigid person- 
ality pattern differentiating both the rubella 
deaf and the nonrubella deaf from the hearing 
group at well beyond the .001 level of confi- 
dence. 

This article reports findings with the MAPS 
test?, also used in the same investigation. The 
MAPS test can be adapted to avoid a fre- 
quent limitation of personality studies of deaf 


1Based on part of thesis in partial fulfillment of 
requirements for the degree of Master of Arts in Psy- 
chology, University of New Zealand. 

2MAPS test (10) isa variation of the TAT, but is 
less structured as figures are separate from the back- 
grounds. S selects and places figures on each background 
picture in turn and then proceeds to tell his story for 
that particular card. Test material comprises 22 back- 
ground pictures 814” X 11”, from which E normally 
selects 11, and a set of 67 figures identifiable by a code 
symbol from a figure identification card. Data derived 
from figure location sheet can be subjected to objective 
itative treatment whereas S’s stories can be 


t 
a ed in the manner of TAT protocols. 
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children that do not take fully into account 
the considerable language retardation of the 
deaf. When the examiner is a trained worker 
with hearing impaired children, and when the 
procedure is altered to eliminate the necessity 
for verbal response and the data are limited 
strictly to the scoring of the formal signs, the 
MAPS technique can seemingly be made 
entirely objective, quantitative, and if neces- 
sary, nonverbal in performance. 

Perhaps the most serious handicap of deaf- 
ness lies in the fact that it prevents the estab- 
lishment of normal interpersonal relationships 
(cf. 1, 4, 8, 9). The fantasy productions of 
deaf children may also be expected to reflect 
this deficit. The general hypothesis as previ- 
ously stated, therefore, leads to the following 
specific prediction to be tested here: MAPS 
records of rubella deaf and nonrubella deaf do 
not differ in the number of “normal” signs 
and “schizophrenic” signs; but the rubella 
deaf score fewer “normal” signs and more 
“schizophrenic” signs than their hearing 
counterparts. 


METHOD 
Subjects 


Rubella deaf group. This group was comprised of 36 
fifteen-year-old rubella deaf children (16 boys, 20 girls) 
who were born between December 1, 1938, and July 31, 
1939. When tested, they were attending coeducational 
government schools for the deaf in New Zealand which 
use entirely oral methods and admit both day and 
residential pupils. All the qualified teachers in these 
schools are trained at the main school for the deaf and 
follow similar educational procedures. The children 
studied? represent a rubella deaf population that lies 
within the extremes of intellectual functioning, since 
both the ineducable, or those with serious multiple 
handicaps, and the most intellectually able were not, at 
the age of 15, attending a school for the deaf. 

To permit investigating whether sex, degree of 
deafness, or the residential factor influence test per- 
formance differentially, this group was chosen tO 
include as many nonresidential pupils as possible, and 
an equal number of “partially deaf” and “totally deaf 


3The names of the children whose deafness was due 
to maternal rubella, or to other known causes before the 
age of two, were compiled from the original data 0 
D. R. Hay (5). 


fe 
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children who could be matched for age, sex, and educa- 
tional classification (above average, average, below 
average). “Partially deaf” and “totally deaf” refer 
here, respectively, to those children with average 
hearing losses >60 decibels but <101 decibels, and 
>100 decibels in the better ear, as tested by a pure- 
tone audiometer in the frequencies 512, 1024, 2048. The 
partially deaf children could use their residual hearing to 
varying extents with the help of individual hearing 
aids. Hence, they acquired language at a faster pace and, 
In general, were taught separately from the severely 
hearing impaired of similar age and intellectual ability. 

Nonrubella deaf group. This group was comprised of 15 
fifteen-year-old children (7 boys, 8 girls) who were 
either born deaf through causes other than maternal 
rubella or who had become deaf before they were two, 
and were all attending the government schools for the 
deaf in New Zealand. The writer is not prepared to 
say that these children are truly representative of a 
nonrubella deaf population; they were the only non- 
rubella deaf children available whose etiology had been 
clearly determined.‘ 

Nondeaf group. This group was comprised of thirty 
15-year-old children (13 boys, 17 girls) selected from 
children who volunteered to be tested and were attend- 
ing a coeducational, multicourse, public, nonresidential 
high school that was situated in the same city as the 
main school for the deaf. The school was selected to 
represent a hearing city school population that fell 
within the extremes of the very highest and lowest intel- 
lectual levels and whose members were reasonably well 
distributed throughout the various socioeconomic 
categories as gauged by parental occupation. A public 
day school was chosen rather than a private residential 
school or an orphanage, since the latter types of institu- 
tion not only do not sample a cross section of the school 
population, but are also far more organized, formal, and 
restrictive in their supervision of the out-of-school 
activities of their charges than are the government 
residential schools for the deaf, which are run on very 
progressive lines. 

Because of the limited number of deaf subjects 
available for testing, the method of matched pairs was 
used to investigate intergroup and deaf subgroup dif- 
ferences. Therefore, from the rubella deaf group of 36 
children, the 30 who were most closely matched for age, 
sex, educational classification, and approximate socio- 
economic status with a member from the volunteer 
normal hearing control group were selected for com- 
parisons with hearing children.® Similarly, but with the 
additional criterion of degree of hearing loss, 15 children 
from the rubella deaf group were paired with children 
in the nonrubella deaf group. Further, to investigate the 
possible influence of the residential factor on test 
performance, each of the nonresidential deaf pupils (7 
rubella deaf, 5 nonrubella deaf) was also matched for 


‘Ibid. 

5 The composition of the nondeaf g e 
age, sex, educational classification, and approximate 
socioeconomic status was directly comparable to that 
of the rubella deaf group. The writer had intended 
originally to match each rubella deaf child for the above 
mentioned variables with a nondeaf child, but she was 
forced to select her normal control group from volun- 
teers of whom only 30 fulfilled her matching criteria. 


roup in respect to 


sex, age, degree of deafness, educational classification 
and approximate socioeconomic status with an ap- 
propriate member from the residential deaf group. 

In conjunction with the Rorschach test and the 
MAPS test, the Wechsler-Bellevue Intelligence Scale 
Performance Form 11 was administered to all subjects 
(Ss) in this study. Statistical analysis of results by 
means of the / technique for matched pairs revealed no 
significant differences between the compared groups 
or deaf subgroups in either mean IQ or mean subtest 
scores, the pattern of which was essentially similar 
(p > .1 in all cases). It should be noted, however, that 
these mean scores for the rubella deaf group, as for the 
nonrubella deaf group, were all consistently lower than 
for the nondeaf group; that these findings were not 
significant may have been due to the small size of the 
samples. (Mean IQ of rubella deaf group was 91.6, and 
SD 12.2, as against mean IQ 101.7, and SD 13.6 for the 


nondeaf group.) 


Administration 

To overcome any bias the MAPS test may have 
against deaf children who are retarded in language 
development, the experimenter (E) altered the test 
procedure so that Ss could respond nonverbally to 
each stimulus situation. In the modification, Ss are not 
asked to relate a complete story about the figures they 
have selected to people each background picture. 
Content analysis in the manner of TAT interpretation 
is therefore precluded, and the scoring and analysis of 
the test findings is limited to the formal psychosocial 
sign categories that Schneidman (11) found to dif- 
ferentiate the fantasy productions of normals from 
psychotics®. The test was given individually to each S 
in a room occupied only by him and Æ. It came as the 
last of the three tests required of him. The specific 
procedure used for all Ss, nondeaf as well as deaf, was 
as follows: 

The E emptied the box containing the cutout figures 
onto the table at which S was seated. The S rearranged 
the figures face upwards. Æ sat slightly behind and to 
the left of S. From his particular position not only 
could E record responses without distracting S, but 
also a deaf S could speech read Æ with relative ease. 
The background pictures: Livingroom, Street Scene, 
Medical, Bathroom, Dream, Bridge, Bedroom, Blank, 
Doorway, Cemetery, and Forest, were presented in 
that order. Æ issued the following instructions orally: 

Have a good look at all the little pictures on the 

table. There is no hurry. Tell me when you are 

finished. . .. Now here is a big picture (first back- 

ground picture placed on table in front of S). I 

want you to make me an interesting story by choos- 

ing pictures from any of these here (indicating 


SShneidman (11) established a set of 59 objective 
signs (39 “normal” and 20 “schizophrenic”) that could 
statistically differentiate these two groups. Shneidman 
defines a MAPS test sign as “a denotable aspect of the 
subject’s performance, specifically having to do with 
his selection and handling of the test material.” 
(11, p. 189). It refers to a structural aspect of the 
performance as against the content of performance and 
relates to what is done with the figures rather than 


what is said about them. 
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figures on table) and putting them on this picture 

(pointing to background picture). You can use as 

many of these pictures (pointing to figures) as you 

i ff you go. , 

Pa a ee oe figure on the background picture 
he was told, for the first card only, that he could use 
more than one figure if he wished. This instruction was 
given in case a deaf S had misunderstood the opni 
test instructions. Æ then told S Yes, that’s right. 
Now tell me about the story.” (E was interested only in 
gaining information to help her to record the theme on 
decide whether figure interaction had taken place an 
was not concerned with securing content material for a 
possible TAT-like analysis: verbal responses were not 

essential for securing this information which could as 
readily have been obtained pantomime fashion.) 

Well, that was a good story. I have some more 
pictures here (indicating pile of background pictures) 
for you to tell me a story about. You can use the 
same little pictures here (pointing to figures on 
first background picture) again, or you can use any 
of the others (indicating remaining figures with a 


ep of the hand). 
sw The S then returned the figures he used to the 
table, the first background picture was withdrawn 
and another put in its place. All the background 
pictures were dealt with in the same manner. 
Schneidman was followed explicitly in the formal 
scoring of the figure location sheets. 


RESULTS AND DISCUSSION 
Quantitative Findings 


To avoid capitalizing on chance differences 
(cf. 6, p. 245), it was decided not to compare 
groups on their total individual sign scores 
unless their composite sign score differences 
yielded a significant xp < .01 (x? corrected 
for continuity, df = 1, cutting point lower 
median score). 

Both MAPS composite sign scores, “normal” 
and “schizophrenic,” failed to differentiate the 
nonrubella deaf group from its matched rubella 
deaf group; they also failed to differentiate the 
matched deaf subgroups formed on the basis 
of degree of deafness, sex, and residential 
factor, respectively. (The differences in all 
cases were not large enough to warrant statis- 
tical analysis.) These results are consistent 
with the first half of the hypothesis under 
investigation; namely, that rubella deaf chil- 
dren do not differ significantly from nonrubella 
deaf children in number of “normal” signs and 
“schizophrenic” signs on the MAPS test, 
Moreover, neither sex, degree of deafness, nor 
the residential factor appeared to have had 
any differential influence on the fantasy pro- 

ductions of the deaf population tested as meas- 
ured by these composite sign scores. 
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On the other hand, in confirmation of the 
latter half of the specific prediction, the MAPS 
test results show that the rubella deaf sample 
gave significantly fewer “normal” signs (p 
< .001) and significantly more “schizophrenic” 
signs (p < .01) than did its matched hearing 
group (Table 1). Because these over-all results 
were significant, the scores for these groups 
were examined for differences in the separate 
sign categories. Out of a total of 59 sign com- 
parisons, 6 “normal” signs and 3 “schizo- 
phrenic” signs were found to differentiate the 
above groups (p < .05—Table 1). 

The deaf (rubella and nonrubella) and non- 
deaf Ss were then compared on these particular 
9 signs and on their composite sign scores, 
the last child in the rubella deaf sample of 36 
being arbitrarily dropped so as to facilitate 
statistical calculations. This step was taken 
in order to test the general hypothesis that, as 
a group, defective hearing children who have 
not naturally acquired speech and language 
behave differently from the hearing. The sta- 
tistical findings confirmed this hypothesis. ‘The 
sign category scores on which the deaf and 
hearing subjects were compared were, without 
exception, found to differentiate them at be- 
yond the .05 level of confidence; both com- 
posite sign scores and all individual sign 
categories, except for FZ (figure interaction) 
with spouse and the presence of M-8 (priest) 
in any background (both dropped from p < 
02 to p < .05) had chi squares that were 
larger than those calculated for the nondeaf 
and the paired rubella deaf group (Table 1). 


Qualitative Findings 


Only qualitative information that supple- 
ments the quantitatively established differ- 
ences will be discussed here. Since the quali- 
tative performances characteristic of the 
rubella deaf are also characteristic of the 
nonrubella deaf, the discussion in this sec- 
tion refers to the deaf as a group. 

The outstanding fact to emerge from com- 
parison of deaf and nondeaf on the “normal” 
sign scores was that the group difference in 
each figure interaction category was highest 
when the interaction represented the most 
personal and frequent interrelationships of 
normal everyday life, and was lowest for the 
least personal and least frequent. Thus, FI 
with child by adult gave p. <001; FI with 


5 
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TABL. 


E1 


COMPARISON OF MATCHED RUBELLA DEAF AND NONDEAF, AND OF DEAF AND NONDEAF GROUPS WITH RESPECT 
TO CERTAIN MAPS Sicns* 


Non- 
deaf 


Rubella 


Category Deaf 


(N = 30) (N = 30) 


p (N= daf xt b 


x 1 a 
50) (N = 30) 


“Normal” Signs 


“Normal” composite sign score 12 28 16.86 <.001 22 28 17.42 <.001 
Child figure three or more times 2 9 4.01 <.05 5 9 14.9 <.001 
Figure interaction with acquaintance 19 27 4.57 <.05 31 27 6.04 <.02 
Figure interaction with spouse 12 22 5.50 <.02 22 22 5.39 <.05 
Figure interaction with parent 5 23 17.87 <.001 7T 23 28.5 <.001 
Figure interaction with child by adult 3 21 20.07 <.001 6 21 25.75 <.001 
In Medical, man takes an exam 1 ll 8.44 <.01 1 11 15.06 <.001 
“Schizophrenic” Signs 
“Schizophrenic” composite sign score 22% 11 6.73 <.01 38 11 9.98 <.01 
Feet off ground 17 5 8.68 <.01 32 5 15.48 <.001 
In any background—priest, M-8 19 9 5.42 <.02 28 9 4.11 <.05 
Ignoring background 18 1 19.72 <.001 30 1 23.04 <.001 


* In testing “normal” and ‘schizophrenic 
and nondeaf groups was used as cutting point (9 and 1, respectively). 


parent p < .001, FZ with acquaintance p < 
.02, FI with spouse p < .05; in comparison 
with the remaining FI categories where no 
statistically significant differences existed: FI 
with stranger and FZ with animal gave p > Al 
in both cases. The lack of themes involving 
figure interaction on the part of the hearing- 
impaired was not so much due to their giving 
more stories to backgrounds that contained 
only one figure, but to their tendency to popu- 
late the backgrounds with figures each going 
his own way or, apparently, representing 
separate stories. For instance, a street scene 
of six figures with no interaction between any 
of them may be given: or, in the dream, four 
figures may be placed, each of whom represents 
a different dream that the sleeper had had. 
Group differences in the selection of the 
same child figure three or more times (p < 
(001) may indicate that the defective hearing 
group show less constancy in their identifica- 
tion with a child figure. This indication of a 
lack of constancy in identification seems to be 
supported by the investigator’s observation 
that the individual deaf Ss, more than the 
hearing, tended to use a greater variety of 
figures to represent the same type of character 
(e.g., a father or mother); in doing this they 
tended, more than did the nondeaf, to include 
in their selection of human figures, figures out- 
side the male adult, female adult, and child 
categories (e.g., minority category: Negroes, 


” composite sign scores for significant differences, lower median score for matched rubella 


Jews, etc., silhouette and blank faces category; 
and undetermined as to sex category). 

Although the nondeaf sample gave a higher 
proportion of figure relationships involving 
verbal interaction, the nature of the interre- 
lationships was, in general, very similar for 
both groups: Physical aggression (mainly 
shooting or fighting) predominated, and the 
most common feelings expressed were anger, 
fear, and to a lesser extent, sorrow, while the 
emotions of love, joy, or happiness were con- 
spicuously absent. 

It has already been mentioned that an out- 
standing characteristic of the fantasy produc- 
tions of the deaf is their lack of themes involv- 
ing interpersonal relationships; another 
outstanding feature is their element of incon- 
gruity due, apparently, to a lack of reality 
testing (cf. “schizophrenic” signs: ignoring 
background, p < .001; feet off ground, p < 
.001). Information from the figure location 
sheets supplied other types of inappropriate- 
ness given by the deaf but not by the hearing 
group. For example, specific figures were 
selected for specific backgrounds for which 
they are not normally selected: F-1 [nude 
female figure] in the Doorway, L-4 [ghost] in 
bath in the Bathroom; incongruous themes 
were given: “This man [M-6—man with a 
gun] shot that man [M-7—supine blood-stained 
male] and policeman [-5—policeman stand- 
ing feet astride, hands on hips] saw”; the 
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placing of M-7 on the operating table in the 
Medical and a male figure beside it, with the 
theme being that M-7 was dead and the other 
figure was the doctor. It is of interest to note 
that these two themes, or very similar varia- 
tions of them, were given by several deaf 
children as against none of the hearing, and 
it is tentatively suggested that they exemplify 
not only incongruity, but also the assigning 
of ineffectual roles to people in authority. 

The more frequent use that the deaf made 
of the schizophrenic sign M-8 (priest, used 
almost exclusively in the Cemetery back- 
ground) can probably be accounted for in 
terms of an association of ideas and does not 
in their case indicate a heightened religiosity 
(cf. Shneidman, 11). 


SUMMARY 


With procedure altered slightly to eliminate 
the deaf subject? language handicap, the 
MAPS test was given to three groups of 15- 
year-old children: 36 rubella deaf, 15 nonru- 
bella deaf, and 30 hearing children. For 
purposes of intergroup matched pair compari- 
sons, members of the nonrubella deaf group 
and nondeaf group, respectively, were matched 
with individuals selected from the rubella deaf 
group; the groups to be compared being then 
(a) rubella and nonrubella deaf (15 pairs) and 
(b) rubella deaf and nondeaf (30 pairs) . 

Results (quantitative and qualitative) for 
both deaf groups were similar and when 
matched deaf subgroups were compared, nei- 
ther degree of deafness, sex, nor institutional 
residence was found to have had any specific 
influence on their performances ($ > .1 in all 
cases). However, the rubella deaf, in compari- 
son with the nondeaf, scored fewer “normal” 


signs ( < .001) and more “schizophrenic” 


signs ( < .01). Six “normal” signs and three 
“schizophrenic” signs were found to differen- 
tiate these two groups (p < .05). 

Furthermore, a comparison of the deaf as a 
whole (rubella and nonrubella, V = 50) with 
the nondeaf revealed that the above signs 
could also distinguish these two populations 
(p < .05 in all instances). 

It is contended that the fantasy productions 
of the deaf, in general, are indicative of their 
social isolation and illogical unrealistic think- 
ing. 
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SHIFT IN JUDGMENT OF WEIGHTS AS A FUNCTION OF 
ANCHORING STIMULI AND INSTRUCTIONS IN 
EARLY SCHIZOPHRENICS AND NORMALS! 


KURT SALZINGER?® 
Columbia University 


HE literature concerned with investiga- 

tion of the variables which are effective 

in changing an absolute scale of judg- 
ment is a rich one (1, 3, 5, 6, 7, 9, 10, 13, 15, 
16) dealing with many varied types of stimuli 
which impinge upon the different sense recep- 
tors of the organism. The consistency of find- 
ings has led investigators to postulate a single 
judgment process to explain the formation of 
and shift in a scale of absolute judgment with 
normal subjects (Ss). 

In this study, differences in the judgment 
mechanism between normal Ss and schizo- 
phrenics were studied by using a modified 
form of the method of absolute judgment. The 
modification was employed because a pretest 
of the method indicated that in the presence 
of the anchor stimuli some Ss instructed them- 
selves to shift their judgments and some to 
keep them constant. The instruction variable 
was therefore controlled by telling Ss not to 
shift. This change in the administration of the 
method of absolute judgment made it possible 
to describe Ss’ judgments in the anchor condi- 
tion in terms of two conflicting discriminative 
stimuli. The first of these, a physical stimulus, 
was the anchor in conjunction with the test 
stimulus following it. The second, a verbal 
stimulus, consisted of the instructions of ex- 
perimenter (E). 

In terms of Goldstein’s conceptualization of 
schizophrenic thinking (2), reaction to the 
verbal stimulus may be viewed as an instance 
of abstract thought, whereas reaction to the 
stimulus characterizes concrete 


1 This investigation was supported in part by research 
grant M586C on the National Institute of Mental 
Health. The article is based on a dissertation submitted 
in partial fulfillment of the requirements for the Ph.D. 
degree in the Department of Psychology in the Faculty 
of Pure Science, Columbia University in 1954. The 
author is indebted to Professors J. Zubin, H. Garrett, 
C. G. Mueller, Dr. S. Kugelmass, and Mrs. S. Salzinger 
for their invaluable aid during the various phases of the 
dissertation on which this paper is based. 

2Now at Biometrics Research, N. Y. State Dept. of 
Mental Hygiene, Psychiatric Institute, 722 West 168 
Street, New York 32, N. Y. 
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thought. Amount of shift (i.e., the change in 
judgment from the unanchored to the anchor 
condition) can then be said to vary as a func- 
tion of the extent to which a given S reacts to 
concrete as contrasted to abstract stimuli. 

The shift phenomenon may be looked upon 
from another point of view, that of constancy. 
If the instruction variable is effective, then it 
serves to keep the judgment constant. Whether 
or not an anchor stimulus precedes it, the 
same judgment is evoked for a given stimulus. 
The extent to which S shifts then serves as an 
index of his inability to preserve constancy. 
In many instances, constancy is maintained 
when the response to abstract aspects of the 
stimulus or stimulus situation is prepotent 
over the response to the concrete aspects of 
the stimulus or stimulus situation, 

On the basis of this formulation, schizo- 
phrenics and normals were compared on their 
ability to respond to concrete versus abstract 
stimuli, or on their ability to maintain con- 
stancy. The central hypothesis was that schizo- 
phrenics are less able to maintain constancy 
or to react to abstract (verbal) stimuli than 
normals. 

Other studies have been made using similar 
techniques on patient groups. Sonder (14) 
found judgment of lifted weights to be prog- 
nostic in a study of schizophrenics. Kaplan 
and Helson (4) demonstrated differences in 
performance between lobotomized patients and 
normals in a pilot study on a weight judgment 
task. Zubin (18) used the same weight judg- 
ment technique as that employed in this study 
for a comparison of patients who leave the 
hospital with those who do not. He found that 
patients who stayed in the hospital shifted 
more than those who left. 

Rado’s (11) theory that proprioceptive 
disorder is an innate trait of schizotypal or- 
ganization gave rise to the second hypothesis 
of this study; namely, that schizophrenics 
make more incorrect judgments than normals 
in a task where judgments depend upon 
proprioceptive discrimination. 
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Mueller and Schumann (8) indicated that 
the lighter a weight, the higher and faster it is 
lifted, and that judgment of heaviness depends 
upon S’s response to the ease with which he 
overcomes each weight’s inertia. The design 
of this experiment allowed a retest of this 
contention as the third hypothesis. 

Both judgment time and height of lift of 
each weight were further explored as relevant 
parameters of the judgment situation. These 
variables were examined in order to elucidate 
any differences in shift found between nor- 
mals and patients. 


METHOD 
Apparatus 


The weight judgment apparatus consisted of a 
turntable which could be revolved in such a way as to 
make available one stimulus at a time to S, an arm 
rest allowing S to keep his hand in a comfortable and 
steady position, and seven weights of 100, 200, 250, 
300, 350, 400, and 900 gms. Each of the weights was 
encircled with lines marking one-cm. distances from the 
bottom so that there was a total of seven such lines. A 
buzzer-chronoscope circuit was so set up that the buzzer 
served as a signal for S to lift the weight, and the time 
from the signal to the time of judgment could be ob- 
tained as an index of judgment time. 


Subjects 


Sixteen acute schizophrenic patients (eight males and 
eight females) were randomly chosen from one male 
and one female ward of the New York State Psychiatric 
Institute. All patients were cooperative and in good 
contact. Sixteen normals (eight males and eight 
females), most of whom were chosen from the School of 
General Studies of Columbia University, were equated 
with patients (matched by pairs) as far as was practi- 
cally feasible in terms of sex and age. The mean age of 
the patient group was 27.6 years (range from 19 to 38 
years), while that of the normal group was 27.2 years 
(range from 19 to 39 years). Patients ranged in educa- 
tional status from one term in high school to the Ph.D. 
while the education of the normals ranged from one 
year of college to the Ph.D. The diagnoses of the 
patients were: 3 schizophrenic undifferentiated, 6 
schizophrenic paranoid, 2 schizophrenic mixed, 3 
schizophrenic catatonic, 1 schizophrenic simplex, and 1 
schizophrenic pseudoneurotic. Patients participated in 
the experiment as part of their regular routine while in 
residence in the hospital. Normals were paid for their 
participation in the experiment. Unfortunately, two 
normal Ss were presented with the experimental con- 
ditions in incorrect order and had, therefore, to be 
discarded from the sample in the second session. 
When normal patient comparisons were made, the 
patients matched with the discarded normals were 
left out of the analysis. 


Experimental Procedures 


Both normals and schizophrenics underwent the 
same experimental procedures. The E ascertained from 
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each S whether he engaged in an occupation involving 
weight estimation (no such S was found) and whether 
he was right or left handed (S was subsequently re- 
quested to use his dominant hand). All Ss were pre- 
sented with the following instructions upon entering 
the experimental room: 

This is a weight judgment task. I am going to 
ask you to judge the heaviness of five weights in 
comparison to each other by calling the heaviest 
“very heavy,” the next heaviest “heavy,” the next 
one “medium,”—that’s the middle weight—the 
next one “light,” and the lightest one “very light.” 

Now suppose you place your arm in this arm rest 
(right for right-handed, left for. left-handed). Right 
underneath your hand you will find the knob of a 
weight. When I give you the signal like this (sound 
buzzer), you will lift the weight approximately one 
inch from the table top, set it down, and give me 
your judgment of it. Now just to show you the cor- 
rect judgments, I’ll present the weights to you in 
order, telling you which judgment applies to each of 
the weights. This is the very heavy weight (sound 
buzzer), this is the heavy one (buzzer), this is the 
medium one (buzzer), this is the light one (buzzer), 
and this is the very light one (buzzer). 

This procedure familiarized S with the proper 
categories and trained S to respond to the buzzer signal 
and to lift the weight the required distance by means 
of his thumb and two neighboring fingers. After this 
initial training period, E made no additional com- 
ments concerning S’s height of lift. At the end of 
the training period Æ continued: 

O.K.? Now you have lifted all the five weights. From 

now on I’ll present the weights to you ina mixed up 

order and you'll give me your judgment of the 
heaviness of the weight in terms of very heavy, 

heavy, medium, light, and very light. O.K.? Vl 

sound the buzzer, you lift the weight, set it down 

and give me your judgment of it. 

The Æ then presented S with the five stimulus 
weights (200, 250, 300, 350, and 400 gm.) five times each 
in the pattern of a Latin square. Each weight was thus 
presented in a different position in each series of five 
weights. This unanchored condition was presented to 
each S at the beginning of the experiment in exactly the 
same way. Four male and four female normals and 
their matched patient Ss were then presented with 4 
heavy anchor; the rest of the Ss were presented with a 
light anchor. The instructions for this anchor condition 
were: 

From now on I’m going to ask you to lift an ae 

tremely heavy (light) weight before lifting each O 

the weights that you are to judge. Don’t bother to 

judge the first weight and make sure that its heavi- 
ness does not affect your judgment of the secon 

weight. O.K.? 

In this condition, the weights were presented to S 
in exactly the same order as in the unanchored condi- 
tion, except that S was required to lift the anchor before 
each weight that he judged. 

At the end of the anchored session, S was allowed to 
remove his arm from the arm rest, and Æ answered 
questions (without disclosing the purpose of the 
experiment) for a period of approximately five minutes: 
When the rest period ended, Æ trained S as in tbe 
beginning of the experiment, repeated the instruction, 
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and then presented S with the unanchored condition. 
At the end of this period, those Ss who had been 
presented with the heavy anchor before the rest period 
were presented with the light anchor, while those who 
had been presented with the light anchor before the 
rest period were presented with the heavy anchor. This 
second anchored condition was presented to S with the 
same instructions as before. 

Fourteen patients and 11 normals were retested after 
a lapse of two days; the other 2 patients were retested 
after 4 days, while 2 of the normals were retested after 3 
days, 1 after 1 day, 1 after 4 days, and 1 after 8 days. 
All Ss were retested with the same procedure except 
that the order of the anchor conditions was reversed. 
The Ss who had previously been given the heavy 
anchor first were now given it second, and vice versa. 

The test weights in the unanchored condition and the 
anchor weights in the anchored condition were pre- 
sented to each S approximately cight seconds after the 
preceding judgment; the test weights in the anchored 
condition were presented approximately four seconds 
after S set the anchor weight down. 

During the eight-second period, Æ recorded the 
judgment of S, the height of lift correct to the nearest 
centimeter, and the decision time correct to the nearest 
-01 of a second. 

The decision time as measured included the time 
between the sounding of the buzzer and S’s lifting of 
the weight, Æ’s reaction time to S’s vocal judgment 
(disconnecting the chronoscope circuit), and S’s 
decision time proper (the time from lifting the weight to 
the judgment). Since the reaction times involved were 
much shorter than the whole decision time, however, the 
latter could still be used for comparative purposes as it 
was in this experiment. 

None of the Ss was aw 
recording either their decision time or 
lift. r 5 
The raw data were converted to stimulus shift scores 
and kinaesthetic scores as described in detail by 
Salzinger (12). These scores were based on the difference 
in grams between the weight to which a particular 
judgment was assigned and the one to which it should 
have been assigned according to the verbal instructions 
given during the preliminary training period. 


are of the fact that Æ was 
their heights of 


RESULTS 


Shift 

Combining the shift scores of both test and 
retest sessions, one shift score was obtained 
for the heavy anchor and one for the light 
anchor for each S. A comparison of normals 
and schizophrenics using Wilcoxon’s paired 
replicates test (17) showed that the latter 
shifted significantly more than the former 
(.02 <p <.05) in the heavy anchor condition 
but did not differ from the normals in the light 
anchor condition (p > -05). 

For the first session alone, 
the same. The schizophrenics 
than the normals (.01 <p < 
unanchored to the heavy anc 


the results were 
shifted more 
02) from the 
hor condition, 


but no difference was found from the unan- 
chored to the light anchor condition (p> 
.05). For the second session alone, there were 
no significant differences between normals and 
patients for either anchor. 

Since there was a significant difference in 
shift between normals and patients in the first 
session but not in the second, the changes 
from the first to the second session were ex- 
amined. Wilcoxon’s paired replicates test 
demonstrated a significant (p = .05) increase 
in shift due to the heavy anchor for normals 
from the first to the second session. There was 
a nonsignificant trend in the opposite direction 
for the patients. No significant differences in 
shift were found for the light anchor conditions. 
The contrary changes in both groups were 
apparently responsible for the elimination of 
the significant difference between them. 

Change in relative status of each S from 
the first to the second session was examined in 
terms of rank order correlation coefficients 
between shift scores. Statistically significant 
correlations (p<.05 by a one tail test) were 
found for the shift due to the heavy anchor of 
both normals (V = 14, rho = +.49) and 
patients (V = 16, rho = +.44). On the other 
hand, correlations for the light anchor condi- 
tion for both normals (rho = +.11) and 
patients (rho = +.33) showed no such rela- 


tionship. 


Kinaesthesis 

There was no statistically significant dif- 
ference (p > .05) by Wilcoxon’s (17) paired 
replicates test between normals and patients 
in their kinaesthetic ability. This was tested by 
comparing the kinaesthetic score (total error 
score in grams) of the patients and their con- 
trols on all unanchored conditions. Compari- 
son of the change in discriminative ability from 
the unanchored to the anchored conditions in 
normals with that of the patients also revealed 
no statistically significant difference. 


Decision Time 


Decision time data were tabulated separately 
for each session, for each weight, and for each 
condition (unanchored, heavy anchor, and 
light anchor). Median values were then com- 
puted separately for the normals and patient. 
Results are shown in Table 1. 

Wilcoxon’s groups-of-replicates test indi- 
cated that the decision time of the patients 
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TABLE 1 
Mept1an Decision TIME (IN SECONDS) OF Junc- 
MENTS OF NORMALS AND PATIENTS FOR FIVE 
DIFFERENT WEIGHTS UNDER CONDITIONS 
or No Ancuor (NA), Heavy ANCHOR 
(HA), AND LIGHT Ancnor (LA) 
In Two SESSIONS 


Normals Patients 

Ses- | Weights 
ne NA | LA | NA | ĦA | NA | LA | NA | HA 
1 200 3.904.443 .864.13|4 0614.564.363 .83 
250 \4.43 4.2414 .05|4.26|5 17|4.67|4.42|4.09 
300 |4.42/4.32|3.88/4.30 5.36/4.81 4.88|4.79 
350 |3.88|4.41/4.01/4.08/4.86/4.26/4.71|4.46 
400 ike, A R A 
2% 200 3.994.283.834. 18/3 . 70/3 .83|3 .56|3 . 59 
250 4.61/4.37/4.51|4.59)3.71]3.863.89/4.01 
300 4.734.694.5113 .86'4.15/4.03/4.41 4.25 
350 |4.38 4.464.47 5.013.863 .98 4.34/4.18 
400 |4.07 4.20|5.02 4.98'3.73 3.51|3.76)4.19 


* There were 16 normals in the first session, 14 in the second 
session, and 16 patients in both sessions. Each S was presented each 
weight five times in each condition. 


was significantly longer (p < .01) than that of 

the normals in the first session and significantly 

shorter (p < .01) than that of the normals in 
the second session. 

Significant interaction, determined by a chi- 
square analogue of an analysis of variance in 
Wilcoxon (17), was found between groups 
(patient vs. normal) and experimental con- 
ditions (anchored vs. unanchored) in the first 
session only, i.e., the largest differences in 
decision time appeared in the unanchored 
condition. The group-condition interaction 
in the second session and the group-weight 
interaction in both the first and second sessions 
were nonsignificant. 

Analysis of the change in decision time from 
the first to the second session showed a sig- 
nificant (p < .01) increase in decision time for 
normals and a significant (p < .01) decrease 
for patients by Wilcoxon’s paired replicates 
test. 

Session-condition interaction was signi- 
ficant (p < .05) for the patients, the heavy 
anchor condition showing the smallest change 
from the first to the second session; session- 
condition interaction for normals and session- 
weight interaction for both normals and pa- 
tients were not significant. 

Differences between conditions were tested 
for by a chi-square analogue of analysis of 

variance. A significant difference was found for 
patients between conditions in the first ses- 
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sion (p < .05) but not in the second. No sig- 
nificant differences were found for either of 
the two sessions of normals. In the first session, 
patients showed the longest decision times in 
the two unanchored conditions, the shortest 
in the heavy anchor condition, and the next 
to the shortest in the light anchor condition. 
Differences between weights were tested by 
the same technique and yielded no significant 
differences for either session of normals. For 
patients, significant differences were found 
between the decision times for different 
weights (p < .05 for first and second sessions). 
The longest decision time was associated with 
the middlemost weight in the series, the 300 
gram weight, with increasingly shorter decision 
times for weights farther from the middlemost. 


Height of Lift 


The accuracy of E’s estimate of height of 
lift was determined by having a person lift a 
weight to a predetermined height by means of 
a pulley system and then recording Æ’s esti- 
mate of height. Ten per cent of the heights were 
incorrectly judged by Æ. No error exceeded -5 
cm. 

Height of lift data were tabulated separately 
for each session, for each weight, and for each 
condition (unanchored, heavy anchor, and light 
anchor). Median values were then computed 
separately for normals and patients. The re- 
sults are presented in Table 2. 

The height of lift data were subjected to the 


TABLE 2 
Mepran HEIGHTS or Lirr (IN cm.) or WEIGHTS 
or NORMALS AND PATIENTS FoR I'IvE DIFFER- 
ENT WEIGHTS AND UNDER CONDITIONS OF 
No Awncnor (NA), Heavy ANCHOR 
(HA), and Licur AncHor (LA) 
IN Two SESSIONS 


Ses- " Normals Patients 
sions Weights 
NA | HA | NA | LA | NA | HA | NA | LA 
1 200 |3.3| 2.0| 2.9 3.5| 4.3| 3.5| 4.0| 4.0 
250 | 3.0] 1.7| 2.9| 3.2| 4.4| 3.4| 4.4| 4.2 
300 | 2.9] 1.8) 2.5] 3.2| 4.3] 3.5] 4.2) 4.2 
350 | 3.2| 2.0] 2.7| 3.6| 4.5] 3.8] 4.4] 4.2 
400 | 2.8] 2.1| 3.2] 3.0] 4.2| 3.6] 4.4] 4.4 
2* | 200 | 3.1] 2.3) 2.7} 3.5] 4.6] 3.4| 4.0} 4.6 
250 | 2.6! 2.3] 2.5| 3.2| 4.4] 3.6| 4.3] 4.9 
300 | 2.7| 2.4] 2.4} 3.3] 4.3] 3.9] 4.4] 4.3 
350 | 3.3] 2.6| 2.6] 3.4| 4.2| 3.7| 4.4] 4-4 
400 | 3.2) 2.4 2.5] 3.7| 4.4] 3.6] 4.2] 4-4 


* There were 16 normals in the first session, 14 in the second, and 
16 patients each in both the first and second sessions. 
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same kind of analysis as the decision time data. 
Wilcoxon’s groups-of-replicates test indicated 
that the patients lifted their weights signifi- 
cantly (p < .01) higher than the normals in 
both sessions. 

Group-condition interaction was significant 
at the .005 level for the first session and at 
the .03 level for the second session. In both 
sessions, the light anchor condition consistently 
produced the smallest differences between 
patients and normals. Group-weight inter- 
action was not significant for either session 
($ > 05). 

Height of lift did not change significantly 
(p > .05) from the first to the second session, 
nor was there a significant session-weight or 
session-condition interaction. 

Analysis of differences between conditions 
and between weights by a chi-square analogue 
of analysis of variance (17), done separately 
for each session of normals and patients, 
yielded significant (p < .05) differences be- 
tween conditions but not between weights. The 
heavy anchor condition consistently resulted 
in the lowest height of lift for both normals 
and patients. Among normals and in the second 
session of the patients, the light anchor con- 
dition resulted in the greatest height of lift, 
the unanchored condition falling in the middle 
between the light and heavy anchor condition. 
In the first session, the patients exhibited the 
second greatest height of lift in the light anchor 
condition and the greatest height of lift in the 
unanchored condition. 

The relationship between the degree of shift 
and the change in decision time from the un- 
anchored to the anchored conditions separately 
for normals and patients in each session and 
for each anchor was explored as a factor rele- 
vant to the judgment situation. Examination 
of the relationship yielded no significant rank- 


order correlations. 


Discussion 

The central hypothesis that schizophrenics 
shift more than normals was confirmed for the 
heavy anchor condition. The ineffectiveness of 


the light anchor compared to the heavy anchor 
may perhaps be accounted for by its being 
only 100 gms. below the lightest test weight, 
whereas the heavy anchor was 500 gms. above 
the heaviest test weight. 


The instability of the differential effect of 


the heavy anchor on normals and patients 
from test to retest required a more exact 
examination. 

Analysis of the change in shift due to the 
heavy anchor showed that the lack of difference 
between normals and patients in the second 
session was primarily due to a change in the 
normals. They shifted significantly more in 
the second than in the first session, thus nar- 
rowing the initial large difference in shift 
between the two groups. 

The correlations between test and retest 
showed that the shifts due to the light anchor 
were not reliable, while the shifts due to the 
heavy anchor for both patients and normals 
showed reasonable reliability ( < .05). 

The results of the analysis of shift may be 
further clarified by exploring the relationship 
of decision time to shift. The change in decision 
time from the first to the second session paral- 
leled that of shift; there was an increase 
in decision time for normals and a decrease for 
patients. Both these changes were significant. 
Further examination made it clear that dif- 
ferences between normals and patients were 
largest in the unanchored conditions. From 
these results it would appear that resistance 
to the anchor may be sustained only at the 
expense of a longer decision time than is other- 
wise necessary. This interpretation is consistent 
with the fact that in the first session, where the 
patients shifted more, their decision times 
were shorter for the anchored than for the 
unanchored conditions, while in the second 
session, where no significant differences in 
decision time occurred between the anchored 
and unanchored conditions, the patients 
shifted less. This notion applies to the first 
session for the normals, where there was no 
significant difference in decision time between 
anchored and unanchored conditions with 
relatively great resistance to shift, but does not 
seem applicable to the second session for 
normals, where slight resistance to shift was 
accompanied by nonsignificant differences 
between anchored and unanchored conditions. 
Furthermore, correlations of amount of shift 
with decision time within the group of normals 
and within the group of patients were not 
significant. 

The height of lift data did not shed any 
light on the change in amount of shift from 
the first to the second session since neither 
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normals nor patients showed differential 
changes from one session to the other. 

It now remains necessary to explain the 
instability in the normal-patient difference 
in ability to shift. Since only the normals 
changed significantly in their shift scores, it 
may be that their motivation decreased from 
the first session to the second. While they 
followed instructions closely in the first session 
because of their initial curiosity, they later 
became bored and had difficulty resisting the 
anchor; consequently, they took longer for 
their decisions. On the other hand, patients 
who showed decreased decision time, but no 
significant change in shift, may be supposed to 
have been as highly motivated to do well in 
both sessions but more anxious to get through 
with the task because of its potential threat 
(to them). This was evidenced by the fact 
that the patients showed concern during 
both testing sessions over whether their per- 
formance would have an effect on their re- 
maining in the hospital in spite of all 
reassurances to the contrary by E. 

Since the correlations between the shift 
scores of the two sessions were reliable though 
low the above interpretation is offered as a 
tentative explanation for the lack of difference 
in shift between normals and patients in the 
second session. 

A comment might be made here concerning 
the low correlations. In general, the psycho- 
physical techniques are used only after in- 
tensive training of highly motivated Ss who 
have achieved high reliability in their judg- 
ments. When applying psychophysical tech- 
niques in an experiment involving patients, 
however, it becomes necessary to keep the 
task sufficiently short to hold the patients’ 
attention. Repetition of the task on successive 
days, which in normals results in highly stable 
performance (learning has reached an as- 
ymptote), may encounter the typical fluctu- 
ations in the pathology of patients. It may be 
of interest to examine this problem by sub- 
jecting both patients and normals to longer 
training sessions. 

The second hypothesis tested in this experi- 
ment, that patients have poorer kinaesthetic 
sense than normals, was not confirmed. Thus, 
while patients may still show poorer pro- 
prioceptive sense than normals in areas other 


than the kinaesthetic one, this experiment 
would seem to call such a contention into 
question. 

As far as the Mueller-Schumann theory was 
concerned, the evidence was inconsistent with 
the notion that height of lift is a cue for judg- 
ment. Neither the normals nor the patients 
lifted the weights to differential heights, but 
they were still able to discriminate the stim- 
uli. 

Exploration of other factors relevant to the 
judgment situation revealed a consistent dif- 
ference between normals and patients in the 
relative amount of time devoted to the judg- 
ment of each weight. The normals appeared to 
use the same amount of time for each weight, 
while the patients took the longest time for the 
middlemost weight with decreasing decision 
times for heavier and lighter weights. 


SUMMARY 


A weight judgment task was administered to 
16 schizophrenic patients and 16 normals for 2 
sessions. The two groups did not differ from 
each other in ability to discriminate between 
the weight stimuli, but the schizophrenics 
shifted significantly more than the normals in 
the heavy anchor condition, thus providing 
evidence for the theory that schizophrenics are 
more prone to react to concrete than to ab- 
stract stimuli, as well as for the theory that 
schizophrenics are less able to maintain con- 
stancy in weight judgment than normals. 
Measures of decision time and height of lift 
were also obtained and used to throw light 
upon the differences in shift. 
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AN ANALYSIS OF INTERPERSON CORRELATIONS AMONG 
THIRTY PSYCHOTICS! 


ROBERT J. WILLIAMS? anp VINCENT S. MACHT? 
Columbia University 


interest in the development of prognostic 

criteria in mental disorders. For the most 
part, early attempts in this area were con- 
cerned with the identification of prognostic 
factors of a clinically observable nature (3, 7). 
Predicting the outcome of psychotic disorders 
from psychological test performance is, by 
comparison, a fairly recent development. 

Windle (11), in a review of the literature on 
the prognostic use of psychological tests, has 
observed that a majority of studies on the 
prognostic value of various psychological tests 
report that good test performance is positively 
correlated with outcome. However, he also 
noted that a number of studies report an in- 
verse relationship between test performance 
and outcome. The apparent contradictions 
were resolved when it was demonstrated that 
good performance on the part of patients with 
an acute (short duration) psychosis, and poor 
performance on the part of patients with a 
chronic (long duration) psychosis were both 
related to favorable outcome (12, 15). 

A test of the prognostic value of psycholog- 
ical measures for recovery from psychosis must 
take account of those conditions which inter- 
vene between test observations and outcome. 
It is pertinent, for example, to consider the 
relationship between therapy and outcome. An 
evaluation of the contribution of therapy to 
outcome must be based on the comparison of 
spontaneous remission rates with remission 
rates following the various types of therapy. 
There is some agreement that special therapies 
produce remission rates no higher than good 
custodial care (9, 15). However, Zubin (13) has 
found that special therapies hasten improve- 
ment in some cases, although in the long run 
the remission rate may be the same as that 
following custodial care. Windle (11) concludes 
that, with respect to prognosis, individual fac- 
tors are probably more important than type of 
therapy. 

1 Portions of this article were presented at the annual 


American Psychological Association convention, Sep- 
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Another intervening condition which must 
be considered in evaluating prognostic indices 
is the time which is allowed to elapse between 
the initial psychological test observations and 
the follow-up observations. Windle and Hamwi 
(12) noted that a considerable amount of the 
confusion surrounding prognostic indices may 
be attributed to the neglect of some observers 
to report the time of follow-up, as well as the 
duration of illness at the time of prognostic 
testing. Very short-term follow-ups are un- 
likely to offer a clear indication of the ultimate 
status of the patient, and of course will fail to 
take into account the possibility of readmis- 
sion. 

An opportunity to relate a variety of psy- 
chological test performances to a sufficiently 
distant follow-up status was provided by the 
psychological test and follow-up data of the 
Columbia-Greystone study (6). This investiga- 
tion was designed originally to measure the 
effects of selective partial ablation of the fron- 
tal cortex. In the course of the study a great 
variety of psychometric observations were 
made on a number of chronic psychotic pa- 
tients. Some were operated and some were 
observed as controls. Tests were made both 
pre- and postoperatively. The patients are still 
under follow-up observation. 

In the present study, factorial methods weré 
applied to the interperson correlations among 
30 chronic psychotic patients, in order (a) tO 
determine if, among chronic psychotics, there 
exists a “psychometric type” with a high 
probability of recovery, and (b) if so, to der 
scribe this “psychometric type.” 

The existence of a “recovery type” has bee? 
suggested by several investigators in this area. 
Balinsky (1), and Lourie, Pacella, and Pio- 
trowski (5) argue that when scores on perform- 
ance tests exceed scores on verbal tests, 
prognosis is more favorable than when the 
reverse relationship exists. Pfeffer (8), Zubi» 
and Windle (14), and Zubin, Windle, and 
Hamwi (15) have hypothesized that chronic 
patients who tend to be conceptually clear bu 
perceptually confused have poor prognosis 
These two hypotheses are not necessarily co” 
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tradictory and may simply be alternative ex- 
pressions of the same idea. Although the 
number of subjects examined in the present 
analysis is small, and the population from 
which they are drawn is difficult to define, it 
is hoped that this study may contribute a little 
to the clarification of these notions. 

Because the conditions under which the 
findings presented here could not readily be 
replicated, the results must be conservatively 
Interpreted. However, an exhaustive analysis 
of a small number of patients may add some- 
thingto knowledge in this area. 


METHOD 


Subjects 


Thirty chronic psychotic patients of the Columbia- 
Greystone study were used in this analysis. Table 1 
presents the age, sex, and date of first hospital admis- 
sion of each patient. For the purposes of the original 
investigation, some of the patients were given special 
therapy in the form of psychosurgery. The last column 
of Table 1 shows whether or not each patient received 
this treatment. The patients are identified by the same 
code numbers as those used to identify the patients in 
the original Columbia-Greystone study. In the table, 
the patients are divided into two groups: those who 
were judged “recovered” by a criterion described 
below, and those who were judged “nonrecovered.” 

A “chronic” psychosis is defined, for the purposes of 
the present study, as one of two years’ duration or 
longer. All of the present subjects (Ss) had been hos- 
pitalized for at least two years at the time of testing 
with a mean duration of 1034 years. À 

Nearly all Ss for whom reasonably complete histories 
could be obtained showed evidence of prehospitalization 
disturbances of behavior, either in the form of poor 
social adjustment (extremely labile work histories, 
unsatisfactory marital adjustment ending in divorce or 
separation, or alcoholism), or in the form of abnormal 
episodes (reported as “convulsions,” “fits,” “tan- 
trums,” “depressions” or attempts at suicide). Ten of 
the Ss had records of hospitalization for behavior 
disturbances prior to their present admission. The avail- 
able personal history data for each patient are reported 


by Mettler (6, pp. 315-389). 


Psychometric Materials 
Sixty-six measures of psychological test performance 


were made on each patient. These measures were made 
as a part of the Columbia-Greystone project during the 
Spring of 1947. Only the initial or preoperative meas- 
ures were used in the present study. A summary list of 
the measures by name is presented in Table 2. Com- 
plete details of each test and scoring procedures are 


available in the literature (6, 12, 15). 


Procedure 
The major steps in the analysis of the data were (a) 


Preparation of a matrix of interperson correlations for 
all thirty patients, (b) factorial analysis of the matrix, 


TABLE 1 
DESCRIPTIVE DATA For 30 CHRONIC Psycuoric 
PATIENTS OF THE COLUMBIA-GREYSTONE 


Srupy 
Date First | ‘at 
Patient Code Atenin, E 
Number oma Tine Sex |Psychosurgery 
Study 
“Recovered” 
Patients 

2 Nov., 1938 | 46 M | Operated 

4 1934 55 M | Operated 

5 Feb., 1945 35 M | Operated 

7 Dec., 1942 40 M | Operated 

1 Jan.,1943 | 44 | M | Operated 
12 Nov., 1940 37 M | Nonoperated 

13 Oct., 1944 43 M | Operated 

19 Mar., 1933 50 M | Operated 

21 Nov., 1926 42 M Operated 

38 1925 38 F Operated 
39 Nov., 1924 44 F Nonoperated 

“Nonrecovered” 
Patients 

1 Dec., 1915 55 M | Nonoperated 

3 July, 1940 61 M | Operated 

6 Nov., 1940 32 M | Operated 

8 Feb., 1942 29 M | Operated 
9 1928 44 M | Nonoperated 
14 Jan., 1933 47 M | Nonoperated 
17 Nov., 1904 60 M | Nonoperated 

18 July, 1943 39 M Operated 

23 June, 1942 39 M | Operated 

25 Aug., 1945 58 M | Operated 
26 June, 1944 35 M Nonoperated 
29 May, 1945 20 M | Nonoperated 

31 June, 1941 | 51 F | Operated 

32 June, 1934 41 F Operated 
35 Nov., 1940 59 F Nonoperated 

36 Apr., 1945 | 53 F | Operated 

40 June, 1943 29 F Operated 

42 July, 1928 40 F Operated 

47 Jan., 1942 31 F Operated 


(c) identification of the factors in terms of test per- 
formances, and (d) determinations of the relationship 
of the factors to recovery and of the interrelationships 
of psychosurgery, age, and sex with recovery. These 
steps are outlined in detail below. 

Preparation of the matrix. The distribution of raw 
scores for each of the 66 measures was dichotomized at 
the median. Each patient was classified as scoring 
either above or below the median on each measure. 
From these classifications, a matrix of coincident above- 
median scores for all possible pairs of patients was 
prepared. This matrix was converted into a correlation 
matrix by converting the coincidence scores and the 
number of above-median scores for each patient into 
percentages, and entering the tables of tetrachoric 
correlation coefficients (4) with these values. The use of 
the tetrachoric correlation coefficient involves several 
assumptions which cannot be verified in the present 
data. However, the tetrachoric coefficients were found 
to agree very closely with several other coefficients 
which were computed for portions of the data and 
closely duplicated the hierarchical order in the original 
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TABLE 2 
PSYCHOLOGICAL TEST MEASURES 


Test Code Number 


Test Measure 


Oonan une 


Word Association Total Reproduction Reaction Time 
Word Association Test Reaction Time 
Word Association Correct Recall 
Word Association Correct Recognition 
Word Association Inaccurate Reproduction 
Rorschach Total Responses 
Rorschach Whole Responses 
Rorschach Detail Responses 
Rorschach Form Responses 
Rorschach Movement Responses 
Rorschach Color Responses 
Rorschach Reaction Time 
Rorschach Ratio 
Levy Movement Cards, Scale 3 (Seeing Persons) 
Levy Movement Cards, Scale 4 (Seeing Movement) 
Anxiety Inventory, Full Scale 
Anxiety Inventory, Self Descriptive Scale 
Complaint Inventory, Full Scale 
Social Ethical Attitudes 
Mirror Drawing Blocking 
Mirror Drawing Frustration 
Complex Reaction Time Test Confusion 
Wechsler Bellevue General Information 
Wechsler Bellevue General Comprehension 
Wechsler Bellevue Digit Span 
Wechsler Bellevue Similarities 
Wechsler Bellevue Vocabulary 
Wechsler Bellevue Picture Arrangement 
Wechsler Bellevue Picture Completion 
Wechsler Bellevue Block Design 
Wechsler Bellevue Object Assembly 
Wechsler Bellevue Digit Symbol Substitution 
Wechsler Bellevue Full IQ 
Porteous IQ 
Porteous MA 
Porteous Q Score 
Complex Reaction Time Test, Number of Correct Solu- 
tions 
Complex Reaction Time Test, Number of Errors 
Albert Test, Number Correct 
Analogies Test, Number Correct 
Essential Differences Test, Number Correct 
PI Sorting Test, Part 1, Number Grouped 
PI Sorting Test, Part 2, Verbalized Groups 
PI Sorting Test, Part 3, Isolation of Single Items 
Weigl Test Performance 
Capps Homograph Test, Verbal Flexibility Score 
Benton Visual Retention Test, Number Correct 
PI Sorting Test, Part 4, Recall 
Memory For Objects, Recall 
Memory For Objects, Recognition 
Subtraction Time 
Addition Time 
Critical Flicker Frequency 
Time Judgment, Total Discrepancy 
Time Interval Test, Reproduction Discrepancy 
Time Interval Test, Verbal Estimation Discrepancy 
Time Interval Test, Operative Estimation Discrepancy 
Time Interval Test, Discrepancy sans Sound 
Time Interval Test, Discrepancy with Sound (Buzzer) 
Mirror Drawing, Total Sides 
Mirror Drawing, Error 
Meaningful Paired Associates, Number of Learning 
Trials 
Semi-meaningful Paired Associates, Number of Learn- 
ing Trials 
Meaningful Paired Associates, Variability in Learning 
Semimeaningful Paired Associates, Variability in 
Learning 
Verbal Directions, Rate of Learning 


coincidence scores. Ease of computation therefore 
favored the use of the tetrachoric coefficients.* 

The factor analysis. The matrix of interpatient cor- 
relations was factored, using Thurstone’s complete 
centroid method with the highest correlation in each 
column or row entered as an estimate of the com- 
munality for each patient. Factoring was terminated 
when the matrices of residuals met Tucker’s criterion 
(2, p. 298). 

At this point, criterion data were obtained for all 
patients. The criterion is based on the hospital status of 
patients as of May 5, 1954. On this date, of the 30 
patients considered in this study, 17 were hospitalized, 
8 were discharged, 4 had died, and one patient had 
been deported. Of the four who died, one died while 
hospitalized, and three died after discharge. The patient 
who died in the hospital died on May 22, 1952 after 19 
years of hospitalization. This patient was assigned a 
hospital status of “Hospitalized.” Of the three patients 
who died after hospital discharge, one (patient 4) died 
on April 9, 1952 after 414 years of nonhospitalization, 
another (patient 13) died January 12, 1954 after 614 
years of nonhospitalization, and the third (patient 19) 
died December 4, 1949 fifteen months after leaving the 
hospital. Each of these three patients was assigned a 
hospital status of “discharged.” The deported patient 
(patient 9) was assigned a hospital status of “hospital- 
ized” on the basis of his deportation from Greystone 
in July, 1951, because of ‘mental illness.” Thus, with a 
criterion of hospital status as of May 5, 1954, the as 
a total of 11 “discharged” or “recovered” patient , and 
a total of 19 “hospitalized” or ‘“nonrecovere ” natients- 
In Table 1, patients are classified as “ret sred” and 
“nonrecovered” by this criterion. = 

An orthogonal rotation of the original factors was 
then carried out so as to maximize the projection of the 
centroid of the “recovered” group on Factor J, while 
minimizing its projection on all other factors. 

Identification of the factors. Patients were selected to 
represent the extreme positive and extreme negative 
ends of each factor. Care was taken to select these p3- 
tients so that their average loading on factors other than 
the one they were selected to represent would be a 
minimum. Above-median performance was assigned a? 
arbitrary value of +1, and below-median performance 
an arbitrary value of —1. The performance of each 
selected patient on each of the 66 measures was then 
weighted by his loading on the factor which the patient 
was chosen to represent, and the products were summe 
over patients for each measure. These summed welg ts 
were used to rank the measures as contributors to aC 
factor, so that the factors could then be described in 
terms of the tests. The factor of principal interest in 
present study is the one which was rotated to pass 
through the centroid of the “recovered” group- 
other factors were described only for the purpose K 
clarifying the description of this “recovery factor- 
These additional descriptions serve the purpose i 
showing what the “recovery factor” is not. 

Interrelations of criterion data. The rotation of one 
of the factors to a criterion of maximizing the contrast 
between recovered and nonrecovered patients tends 
capitalize on chance factors which may contribute 
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has suggested that the coincident above-median e y 
corrected for chance, might have been factored direct’ 


r 


A 


ANALYSIS OF INTERPERSON CORRELATIONS AMONG THIRTY PsycHorTics 53 


TABLE 3 
INITIAL AND ROTATED FACTOR Loapincs or THIRTY CHRONIC Psycuotic PATIENTS 
UNROTATED ROTATED 
I I II IV I II Ul IV h? 
Recovered 

2 —.340 —.093 246 731 360 2 
i F x : -289 271 .658 720 
f -508 -513 O44 -293 .683 —.308 —.043 —.213 .609 
3 -581 -184 .052 -289 .469 —.486 .002 —.029 .458 
i .383 —.264 —.046 —.160 —.200 —.449 —.029 —.036 .244 
a — .046 .382 215 -052 312 .171 167 —.207 .197 
13 176 —.312 464 .520 ` .216 —.270 -491 .503 .614 
‘ -6388 —.238 —.134 169 090 —.681 —.131 146 .510 
a = 230' -582 150 —.254 197 .411 .086 —.513. .478 
A — .322 -437 .191 —.243 .084 .449 .149 —.399 390 
: — .180 -051 —.156 151 -016 .187  —.748 .188 630 
Noa 39 : —.591 .375 —.190 —.020 .084 (682 —.209 —.101 .526 

Tecovere 

1 —.556 —.444 401 —.064 —.451 376 478 313 .671 
3 —.506 —.262 —.253 —.437 — .623 390 —.195 —.032  .579 
6 408 —.195 —.435 —.201 —.212 —.450 —.425 —.081 .435 
8 189 144 414 145 -285  —.130 .384 —.059 .249 
9 —.321 —.561 —.112 -138 —.392 .115 —.026 „531 .450 
n = ae - ue = Pa = 26 = 2 .151 —.175 .139 380 
-. 3 k 355 404 553 -048 .063 527 
18 — .462 .044 —.222 348 125 .450 —.205 .354 .385 
23 —.394 —.240 .038 —.134 — .350 291 085 131 .232 
25 .720 433 307 176 .628 —.534 .218 —.321  .830 
26 .407 —.158 —.452 172 .059 —.436 —.446 177424 
29 400 —.127 .211 —.382 —.222  —.419 .207 —.314 .366 
31 «223 .215  —.281 —.261 .004 —.138 .243 
32 —.558 —.189 —.027 —.475 — .588 -463 574 
35 —.289 —.142 -O41 -056 —.130 :225 .109 
34 -309 -702 .049 —.217 .429 —.057 638 
40> .383  —.320 197 —.370 —.355 —.468 425 
42 — .047 341 — .225 +113 «219 -158 À 3 182 
47 654 —.162 —.267 141 .115 —.671  —.273 -086 .545 


that contrast. Therefore, no genuine test of the sig- 
nificance of the relationship between this factor and 
“recovery” is possible. However, White’s rank order 
test (10) was applied with this serious reservation. 
The possibility that the “recovery factor” might be 
an artifact of age or sex differences was examined by 
determining the relation of these variables to the factor 
loadings, using White’s rank test for sex and the 
rank-difference correlation for age. (An examination of 
the descriptive data in Table 1 shows that the “re- 
» criterion used in this study is unrelated to 


covery 
x, or psychosurgery.) 


duration of illness, age, se: 


Resutts* 
acted from the matrix 


Four factors were extr: € 
The matrix of re- 


of interperson correlations. 


too bulky to be reproduced 


í The following tables, 1 
the American Docu- 


here, have been deposited with c 
mentation Institute: (a) The above- or below-median 
each of the 66 measures, 


classification of each patient on € 
(b) the matrix of coincident above-median scores, (c) 
the matrix of initial intercorrelations, (d) the matrix of 
residuals, (e) the transformation matrix for rotating 
the factors, and (f) a table showing the rank, for each 
factor, of each of the 66 measures. Order document No. 
5197 from the ADI Auxiliary Publications Project, 
Photoduplication Service, Library of Congress, Wash- 


siduals after the extraction of these four factors 
did not warrant the extraction of additional 
factors according to Tucker’s criterion. These 
factors were then rotated as described above 
so that the first factor would pass through the 
centroid of the “recovered” group. The origi- 
nal and rotated factors are presented in Table 
3, where “recovered” and “nonrecovered” pa- 
tients are separately grouped. With but one 
exception (patient 7), all of the “recovered” 
patients have positive loadings on the rotated 
Factor I. Of the 19 “nonrecovered” patients, 
10 have negative loadings on Factor I, and 9 
have positive loadings. The difference between 
Factor I loadings of the “recovered” and 
“nonrecovered” patients is significant at the 
.05 level by White’s rank-order test (T = 122 
nı = 11, ng = 19). (This finding must be con- 
servatively interpreted because of the reserva- 
tions noted earlier concerning the application 
of any test of significance to these data.) 


ington 25, D. C., remitting in advance $1.25 for micro- 
film and $1.25 for photocopies. Make checks payable 
to Chief, Photoduplication Service, Library of Congress. 
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No relationship was found between re- 
covery-nonrecovery and psychosurgery. A 
chi-square test of the independence of these 
two principles of classification for these pa- 
tients yielded a value of 2.302 with one degree 
of freedom, P lying between .20 and .10. 

Factor I was found to be unrelated to age 
(rho = —.11, not significantly different from 
zero). This factor was also found to be unre- 
lated to sex by White’s rank test (T' = 125.5, 
m = 9, m = 21, not significant). Similarly, 
the criterion of “recovery” is unrelated either 
to age (a chi-square test of the independence 
of the criterion and age categories of 20-34 


years, 35-49 years, and 50-64 years, yielded a 
value of 1.176 with 2 df, P falling between P| 
and .5), or sex (chi-square of 437 with 1 df, P 
between .7 and .5). Neither the criterion nor 
Factor I is related to duration of illness at time 
of testing. No relation was anticipated since 
these patients were selected as having chronic 
(or long duration) psychoses. 

Table 4 lists the 20 measures which contrib- 
uted most to a definition of each factor. The 
measures are listed in a decreasing order of 
importance (as defined above in the Procedure 
section) as contributors to the factor. The 
minus signs in front of some test code numbers 


TABLE 4 
PSYCHOLOGICAL MEASURES WHICH CONTRIBUTE MOST TO A DEFINITION OF Eacu FACTOR 


FACTOR I 


FACTOR II 


—42 Essential Differences, Correct 
—48 Benton Visual Retention, Correct 
—31 WB Block Design 
—24 WB Comprehension 
54 Addition Time 
—30 WB Picture Completion 
—27 WB Similarities 
—38 Complex Reaction Time, Correct 
—34 WB Full IQ 
37 Porteous Q score 
—23 WB Information 
22 Complex Reaction Time, Confusion 
—29 WB Picture Arrangement 
—59 Time Interval, Operative Est. 
—60 Time Interval, Silent 
—61 Time Interval, Sound 
—25 WB Digit Span 
—28 WB Vocabulary 
19 Social-Ethical 


53 Subtraction Time 


—5 Word Association, Incorrect 
—53 Subtraction Time 
23, WB Information 
28 WB Vocabulary 
47 Capps Homographs 
49 PI Sorting, Recall 
68 Verbal Directions, Rate of Learning 
—19 Social-Ethical 
45 PI Sorting, Single items 
3 Word Association, Correct 
—8 Rorschach, Detail 
—10 Rorschach, Movement 
—1 Word Association, Reaction Time 
46 Weigl Performance 
—58 Time Interval, Verbal 
—59 Time Interval, Operative 
—60 Time Interval, Silent 
—64 Meaningful Paired Associates, Trials to learn 
—65 Semimeaningful Paired Associates, Trials to 
learn 
27 WB Similarities 


FACTOR TI 


FACTOR IV 


—15 Levy Movement Cards 
6 Rorschach, Total Responses 
20 Mirrow Drawing, Blocking 
23 WB Information 
29 WB Picture Arrangement 
53 Subtraction Time 
65 ere Paired Associates, Trials to 
earn 
—2 Word Association, Reaction Time 
—19 Social-Ethical 
—37 Porteous Q 
—39 Complex Reaction Time, Errors 
43 PI Sorting, Number Grouped 
48 Benton Visual Retention, Correct 
57 Time Interval Reproduction 
62 Mirror Drawing, Total sides 
—22 Complex Reaction Time, Confusion 
8 Rorschach Detail 
63 Mirror Drawing, Error 
_44 PI Sorting, Verbalized 
_45 PI Sorting, Single Items 


—21 Mirrow Drawing, Frustration rs 
—67 Paired Associates, Semimeaningful Variability 
—16 Anxiety A 

—17 Anxiety B 

—22 Complex Reaction Time, Confusion 

—37 Porteous Q Score 

—47 Capps Homographs 


—68 Verbal Directions, Rate of Learning 
29 WB Picture Arrangement 
40 Albert Test 
20 Mirror Drawing, Blocking 
—50 Memory for objects, Recall 
—39 Complex Reaction Time, Errors 
9 Rorschach, Form 
34 WB Full IQ 
2 Word Association, Reaction Time 
4 Word Association, Correct 
33 WB Digit Symbol 
31 WB Block Design 
—52 Memory for Objects, Recognition 


& 
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indicate that below-median performance on 
those tests contributed to a definition of the 
positive end of that factor. For tests without 
minus signs, above-median performance con- 
tributed to a definition of the positive end of 
that factor. 

The relation of Factor I to the recovery 
criterion shows that patients who eventually 

recover” almost always have high loadings 
on this factor, although patients with high 
loadings on this factor do not invariably re- 
cover. For this reason an additional comparison 
way carried out. The performance of the five 

recovered” patients with highest loadings on 
Factor I (patients 2, 4, 5, 11 and 12) was com- 
pared with the performance of five “nonre- 
covered” patients with highest loadings on 
that factor (patients 8, 17, 25, 36 and 42), using 
a procedure similar to that which was applied 
in obtaining descriptions of the factors in terms 
of the tests, except that the performances were 
not weighted by the factor loadings. It was 
found that the recovered patients more often 
than the nonrecovered show superior perform- 
ance on Seeing Persons and Seeing Movement 
in the Levy movement cards, the Albert test, 
Rorschach Movement, the Picture Arrange- 
ment, Object Assembly, and Digit-Symbol 
Substitution subtests of the Wechsler-Bellevue, 
and the PI Sorting test. The recovered pa- 
tients show inferior performance on Word 
Association Reaction Time, Time Interval 
Reproduction, and Rorschach Reaction Time. 
Nonrecovered patients, on the other hand, are 
superior to the recovered patients on Verbal 
Directions, WB Vocabulary, Porteous IQ and 
MA, the Analogies Test, the Weigl test, the 
Capps Homograph test, and Verbalized Group- 
ings on the PI Sorting test. The nonrecovered 
group required fewer trials to learn Semi- 
meaningful Paired Associates, but had greater 
“confusion” scores on Complex Reaction Time. 


Discussion 
th all of the intelli- 


Factor I is associated wi n 
“recovery” as- 


gence tests. In its positive oF 
pect, it is associated with poor performance on 
these tests as well as with long reaction times 
and poor time judgment. Factors II, MI, and 
IV are also associated with “intelligence” 
measures, and the components of intelligence 
involved in these factors are more easily iden- 
tified than in Factor I. Therefore, 4 practical 
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approach to a description of Factor I may be 
first to describe what it is zot in terms of Fac- 
tors II, III, and IV. 

Factor II in its positive aspects appears to 
represent good memory and concept formation. 
It is clearly a verbal factor as indicated by the 
high relative importance of the General Infor- 
mation, Vocabulary, and Similarities subtests 
of the Wechsler-Bellevue in its positive as- 
pects, and Word Association Inaccurate Re- 
production on its negative side. It also repre- 
sents concept formation at the verbal level as 
indicated by the high relative importance of 
the PI Sorting-Recall and Isolation of Single 
Items, and Weigl Test Performance. Some 
verbal flexibility is indicated by the Capps 
Homograph Score. Factor II may be thus 
described as containing various verbal com- 
ponents of intelligence. 

Factors III and IV both appear to involve 
perceptual abilities. Factor III is associated 
with Rorschach Total Responses and Detail, 
Mirror Drawing scores (scores 20, 62, and 63), 
Picture Arrangement subtest of the Wechsler- 
Bellevue, the Benton Visual Retention Test, 
and Seeing Movement in the Levy Movement 
cards. Patients who have high loadings on this 
factor seem capable of good performance on 
the PI Sorting tests, but are unable to verbal- 
ize their groupings. Factor IV is associated 
with the Picture Arrangement, Block Design, 
and Digit Symbol subtests of the Wechsler- 
Bellevue and Rorschach Form responses. Cer- 
tain verbal tests contribute negatively to this 
factor, such as the Capps Homograph test and 
the Wechsler-Bellevue Similarities and Vocab- 
ularly. Also contributing negatively are Mirror 
Drawing Frustration and both Recall and 
Recognition Memory for Objects. Both Factors 
III and IV, then, seem to contrast good “‘per- 
ceptual” performances with poor “verbal” per- 
formances. The first seems to deal with the 
perception of concrete and familiar objects, 
whereas the latter is concerned with the per- 
ception of somewhat more abstract designs. 

Factor I, then, cannot represent the verbal 
and perceptual components of test perform- 
ance. The positive end of Factor I, which is 
associated with “recovery,” represents low 
scores on all of the various tests of “intelli- 
gence.” However, there is reason to believe 
that poor performance results from the failure 
of these patients to respond to the test situa- 
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tion. Confusion and sluggishness is indicated 
by the incorrect estimation of time intervals, 
and the various long reaction time scores. Poor 
test performance, probably brought about by 
failure to respond to the test situation, seems 
then to be a necessary condition for recovery 
from chronic psychosis. 

Among the patients who had high loadings 
on Factor I, the “recovered” group is distin- 
guished from the “nonrecovered” group in 
that the “recovered” patients seem better able 
to perform well on “perceptual” tasks (Levy 
Movement cards, Albert test, Rorschach 
Movement, WB Picture Arrangement, Object 
Assembly and Digit Symbol, and the PI Sort- 
ing test) which require some learning in the 
immediate situation. The nonrecovered pa- 
tients are apparently better able to make use 
of previous learning, particularly of a verbal- 
conceptual nature (Verbal Directions, Vocab- 
ulary, Porteous IQ and MA, Analogies, and 
Homographs). These findings lend some sup- 
port to the hypothesis that prognosis is favor- 
able when “perceptual” scores exceed ‘‘con- 
ceptual” scores (8, 14, 15). However, this 
hypothesis concerns intrapatient comparisons 
and the above findings are based on inter- 
patient comparisons. Also, because they are 
based on comparisons among only ten patients, 
they must be interpreted with some caution. 

Thus, the conditions favorable for a progno- 
sis of recovery from chronic psychosis appear 
to include (a) generally poor performance on 
tests of intellectual abilities, (b) confusion and 
sluggishness as evidenced in long reaction 
times and poor time estimation, and (c) some- 
what better performance in perceptual tasks 
requiring learning in the immediate situation 
over performance on verbal-conceptual tasks 
which depend on prior learning. 


SUMMARY 


The purpose of this study was (a) to deter- 
mine if, among chronic psychotics, there exists 
a “psychometric type” with a higher than 
chance probability of recovery, and (6) if so, 
to describe that type. 

The subjects were 30 chronic psychotics of 
the Columbia-Greystone study, 11 of whom 
were classified as “recovered” as of May 5, 


1954. i 
A matrix of the interperson correlations 


among all subjects based on above-median 
scores on 66 psychological test measures was 
factor analysed by Thurstone’s complete cen- 
troid method. Four factors were extracted. 
These were rotated to maximize the projections 
of the “recovered” group on one factor while 
minimizing their projections on all other fac- 
tors. The conclusions drawn from the analysis 
are as follows: 

1. Among chronic psychotics there exists a 
“psychometric type” with a better-than- 
chance probability of recovery. This type is 
not distinguished by age, sex, or duration of 
illness. 

2. The type is characterized by generally 
poor performance on intellectual tasks, long 
reaction times, and poor time estimation. 

3. Among patients of this type, those who 
recover perform better than those who do not 
on perceptual tasks requiring new learning, 
whereas those who do not recover perform 
better than those who do on verbal-conceptual 
tasks depending on previous learning. 


REFERENCES 


1. Barsky, B. Factors in the vocational adjustment 
of schizophrenics after mental hospital dis- 
charge. J. clin. Psychol., 1947, 3, 341-349. 

2. Carrett, R. B. Factor analysis; an introduction 
and manual for the psychologist and social scientist. 
New York: Harper, 1952. 

3. CHASE, L. S., & SILVERMAN, S. Prognostic criteria 
in schizophrenia; a critical survey of the litera- 
ture. Amer. J. Psychiat., 1941, 98, 360-368. 

4. CHESIRE, L. S., Sarrir, M., & THURSTONE, Ea i 
Computing diagrams for the tetrachoric correlation 
coefficient. Chicago: The Univer. of Chicago 
Bookstore, 1933. 

5. Lourir, R. S., PACELLA, B. L., & PIOTROWSKI, Z. 
A. Studies on the prognosis in schizophrenic-like 
psychoses in children. Amer. J. Psychiat., 1943, 
99, 542-551. 

6. METTLER, F. A. (Ed.) Selective partial ablation of the 

frontal cortex. New York: Hoeber, 1949. 

. Pascat, G. R., Swensen, C. H., FELDMAN, D. A» 
Corr, M. E., & BAYARD, J. Prognostic criteria 10 
the case histories of hospitalized patients. 
consult. Psychol., 1953, 17, 163-171. 

8. Prerrer, M. Pattern of abilities in chronic schizo- 
phrenics. Unpublished doctor’s dissertation, 
Columbia Univer., 1952. 

9. SCHERER, I. W. Prognoses and psychological scores 
in electroconvulsive therapy, psychosurgery, an 
spontaneous remission. A mer. J. Psychiat., 1951, 
107, 926-931. 

10. Wurte, C. The use of ranks in a test of significance 

for comparing two treatments. Biometrics, 1950, 


8, 33-41. 


x 


ANALYSIS OF INTERPERSON CORRELATIONS AMONG THIRTY PsycHorics 57 


11. WINDIE, C. Psychological tests in psychopathologi- 14. Zusry, J., & Wrvte C. Psychological prognosis of 


cal prognosis. Psychol. Bull., 1952, 49, 451-482. outcome in the mental disorders. J. abnorm. soc- 
12. Winpte, C., & Hamwi, V. An exploratory study Psychol., 1954, 49, 272-281. : 

of the prognostic value of the Complex Reaction 15. ZUBIN, J., WINDLE, C., & Hamwı, V. Retrospective 

Time Test in acute and chronic psychotics. J. evaluation of psychological tests as prognostic 

clin. Psychol., 1953, 9, 156-161. instruments in mental disorders. J. Pers., 1953, 
13. Zubin, J. Evaluation of therapeutic outcome in 21, 342-355. 

mental disorders. J. nerv. ment. Dis., 1953, 117, 

95-111. Received August 13, 1956. 


FREQUENCY AND TDIOSYNCRACY OF ASSOCIATIVE RESPONSES' 


GEORGE MANDLER anp ELIZABETH W. PARNES 
Harvard University 


NDIVIDUAL differences in response to envi- 
ronmental stimuli can be described in 
terms of the frequency and the normative 

aspects of the responses elicited by those stim- 
uli. Thus, the response to or perception of a 
stimulus is determined by the particular array 
of responses which that stimulus elicits from 
the subject (S). The responses which thus dif- 
ferentiate the stimulus from other stimuli rep- 
resent not only its immediate “meaning,” but 
can be conceptualized as the result of the S’s 
previous learning experiences with such stim- 
uli (4). It has also been argued that this ac- 
quisition of differentiating responses forms the 
basic paradigm for perceptual learning (5). 

Empirically, it has been shown that the 
frequency of responses elicited by verbal stim- 
uli is related to familiarity with the stimulus 
(10) and to speed of serial and paired-associate 
learning (2, 7, 9). Noble (8) and Mandler (4, 
6) have suggested a relationship between the 
associative frequency of a stimulus and its 
meaningfulness, theoretically defined or em- 
pirically determined. Finally, it has been shown 
that individual rates of learning are related to 
the subject’s frequency of emitted responses 
(7). 

This paper reports two studies of the con- 
sistency of individual differences in frequency 
of response to different stimuli and the effect 
of stimulus variation upon response emission. 
Five general hypotheses are examined: 

1. There are significant differences among 
Ss in both frequency and idiosyncratic content 
of associative responses. 

2. The Ss show consistent tendencies in the 
number of responses evoked by a variety of 
different stimuli. 

3, There are consistent tendencies in the 
content of the responses evoked by different 
stimuli, defined as the relative frequency of 
idiosyncratic or individual-specific responses. 

4, Stimulus classes differ from each other in 
the elicitation of responses in respect to both 


frequency and to content. 


i This study was supported by grants from the 
Laboratory of Social Relations, Harvard University, 
and from the Foundations Fund for Research in Psy- 


chiatry. 
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5. A failure-stress situation produces meas- 
urable differences in individual response out- 
put and idiosyncratic content. 

It is patently impossible to elicit and to 
quantify free association material in a con- 
trolled laboratory situation. However, this 
study represents an initial attempt to evoke 
controlled associations to a variety of different 
materials. The term “controlled” rather than 
“free” is used since the S is instructed to give 
those responses which are directly elicited by 
the stimulus. In this manner, a first approxi- 
mation to individual differences in associative 
material may be obtained. The materials used 
in this preliminary study were selected to rep- 
resent a diversified range of visually presented 
materials. Both verbal and nonverbal stimuli 
were included in order to ascertain any effect 
which might be directly related to response 
differences directly attributable to the verbal 
nature of the stimulus materials. In addition, 
it was decided to include some stimuli which 
are part of the clinical armamentarium in or- 
der to ascertain any differences between fairly 
simple stimulus materials and those classically 
believed to represent the best conditions for 
the elicitation of personally meaningful ma- 
terial. 

In line with these considerations, four classes 
of stimulus materials were selected. The two 
types of verbal stimuli consisted of nonsense 
syllables and adjectives. It has experimentally 
been shown that differences in the associative 
frequency of nonsense syllables are empirically 
useful, while adjectives were selected as repre 
senting verbal materials more closely relate 
to the S’s everyday learning experiences. The 
nonverbal stimuli consisted of line drawings 
and Rorschach cards. The line drawings, rang- 
ing from simple objects to nonsense figures, 
again tap a more familiar array of stimuli, 
while the Rorschach cards represent a clinically 
useful and relatively unstructured stimulus 
situation. 

In order to assess further the usefulness of 
this technique in assessing personality vari- 
ables, it was decided to introduce a simple yet 
widely used experimental procedure. Failure 
stress was selected for this part of the stu 
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because it has been shown to be useful in the 
elicitation of personality differences (3). 


METHOD 


The first of two studies (Study I) investigated the 
responses of 20 Ss to 40 different stimuli, ten in each of 
four stimulus classes: Nonsense Syllables, Adjectives, 
Drawings, and Rorschach cards. In Study II, two 
groups of 12 Ss each were used. The control group 
responded to 24 stimuli (six from each of the four 
stimulus classes); the experimental group responded to 
12 stimuli, were then given failure-stress instructions, 
then responded to the remaining 12 stimuli. 


Materials 


In Study I, the stimuli were selected from each of 
the four stimulus classes in order to obtain a maximum 
range of differentiation in meaningfulness and structure 
on the basis of available information.* 

Nonsense syllables were selected from the list of 100 
published by Mandler (6). This standardization pro- 
vides an index of associative frequency which describes 
the mean number of associations elicited during a 30- 
sec. period. These 100 syllables were divided into ten 
groups of 10 syllables each on the basis of an index of 
ambiguity. This index is derived from the difference 
between the mean number of different responses elicited 
by a stimulus and the mean number of total responses 
elicited. Ambiguity increases as this index approaches 
zero. An index of zero would be obtained if no response 
to a particular stimulus is given by more than one S. 
One syllable was chosen at random from each of the ten 
ambiguity groups. 

Nine adjectives were selected from each of nine 
different frequency categories from the Thorndike- 
Lorge word list. They ranged from more than 100 per 
one million words to 4 per 18 million words. The tenth 
adjective was selected from an unabridged dictionary 
as one appearing both unusual and unfamiliar to the 


experimenters. 


‘All 10 drawings were selected on the basis of unpub- 


lished data collected by Baum! from the pictures used 
by Heidbreder in her concept formation studies (1). 
The range of drawings was from simple realistic objects 
to abstract complex figures, and selection was based on 


the number of different single responses elicited by the 
sequent trials. The selected drawings 


to 86 per cent in the percentage 
licited from a group of Ss, and 
t in the percentage of Ss 


stimuli on two sub: 
ranged from 11 per cent 
of different responses € 
from 11 per cent to 73 per cen 


e been deposited with 


2 The following materials hav e 
Institute: Lists of 


the American Documentation . 0 
stimulus materials used; the initial correlation matrix 


for the factor analysis; tables of response frequency and 
idiosyncratic percentage for the four stimulus classes 
rder Document No. 5198 


for the 20 Ss in Study I. O k 
from ADI Auxiliary Publications Project, Photodupli- 
cation Service, Library of Congress, Washington 25, 
D. C., remitting $1.25 for 35 mm. microfilm or $1.25 
for 6 x 8 in. photocopies. Make checks payable to Chief, 
Photoduplication Service, Library of Congress. , 

3 Thanks are due Dr. Marian H. Baum, of Oberlin, 


Ohio, for making these data available. 


who changed the response from trial one to trial two, 
thus covering a wide range of response variability. 
The ten standard Rorschach cards were also used. 
In Study II, only six stimuli were used in each of the 
four stimulus classes. The selection procedure was iden- 
tical for the four classes. The stimuli within each class 
were ranked on the basis of the number of responses 
elicited in Study I, and ranks Nos. 1, 2, 5, 6, 9, and 10 
were selected. In the case of tied ranks, the stimulus 
with the greatest number of popular responses (defined 
below) was chosen. The range obtained in respect to 
ambiguity and frequency was highly similar to that in 
Study I for the Syllables, Adjectives, and Drawings. 
This procedure resulted in the selection of Rorschach 
cards Nos. 3, 5, 7, 1, 6, and 8 in theorderof rankschosen. 


Subjects 


The Ss in Study I were 19 male and one female 
undergraduate students recruited through the Uni- 
versity employment bureau. In Study II, the Ss were 
12 male and 12 female summer school students. The 
control and experimental groups consisted of six male 
and six female Ss each, assigned at random. 


Procedure 

Instructions. The Ss in both studies were initially 
informed that they were to take part in an experiment 
designed to ascertain “what kind of responses people 
give to certain stimuli.” They were instructed to re- 
spond to each stimulus with all the “words that the 
stimulus brings to mind.” Ss were asked not to give 
chain-associations, but rather to be sure to give re- 
sponses directly elicited by the stimulus. They were 
also requested not to give phrases or sentences, but to 
restrict themselves to single words. 

In Study II, the control group proceeded after the 
initial instruction without any further interruptions. 
The experimental group was interrupted after the 12th 
stimulus presentation. At that time the experimenter 
(E) informed them that they were being interrupted 
because they were doing so poorly and that they either 
showed low ability or were “not trying at all.” They 
were asked to buckle down on the rest of the stimuli. No 
other information was given them. They then proceeded 
with the second half of the stimulus presentation. 

Stimulus presentation and assignment. The stimulus 
materials were presented by means of a modified juke 
box mechanism, which allowed the Æ to select the 
serial presentation in a predetermined order. Stimuli 
were mounted on 10-in. discs and presented to the S 
through a mirror arrangement. All stimuli were pre- 
sented for 30 sec. with an interval between stimuli of 
approximately 10 sec. 

In Study I, each S received a different order of 
stimulus presentations. The order of the 40 stimuli 
was random except that each block of four successive 
stimuli contained one item from each of the four 
stimulus classes, and no two items of the same stimulus 
class followed each other. 

In Study II, twelve such series were prepared. Thus, 
the 12 control Ss and the 12 Ss in the experimental 
group received identical serial presentations, with the 
additional provision that Ss receiving the same stimulus 


order were of the same sex. 
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Dependent Variables 


The responses of all Ss were recorded and tran- 
scribed. For purposes of analysis, a response was defined 
in the following way: 

1. Comments to the E, exclamations, connectives, 
and other obviously irrelevant responses were excluded. 

2. When the S responded with a phrase (contrary to 
the instructions), the response was categorized by the 

main noun, if any; otherwise, by the main verb. The 
main word was defined as appearing in the major 
clause of a sentence. 

3. For counting replications of a response by Sona 
particular stimulus, a response was tallied only once if 
the replications represented plural or singular forms of 
the same noun, different forms of the same verb, or 
verbs and adjectives with identical stems. Thus, a 
single response is defined as a single word, complex 
word, or phrase. For coding purposes, responses were 
reduced to single words. 

These responses were tabulated separately for each 
stimulus. In order to establish frequency distributions 
of responses given to a particular stimulus by all Ss, 
responses were combined, i.e., counted as replications, 
in a manner similar to that used for replications by Ss. 

In Study I, the following measures were obtained 
from these tabulations for each S: 

F—the number of responses given. 

I—the number of idiosyncratic responses given, 
defined as those responses which are given to a 
particular stimulus by one, and only one, S. 

I%—the percentage of idiosyncratic responses 
out of the total number of responses. 

P—the number of popular responses given, de- 
fined as those responses which are given to a partic- 
ular stimulus by 40 per cent or more of the Ss. 

P%—the percentage of popular responses out of 
the total number of responses. 

Pp’—the number of popular responses given as 
initial responses to the stimuli. 

T—total popularity measure of responses. T for 
any one S is defined as the sum of the index of 
popularity of all responses. This index is the number 
of Ss giving that response. Thus, the T value for an 
idiosyncratic response is one, of a response given 
(to a particular stimulus) by 12 Ss is 12. 

T/F—mean popularity index. 

X%—percentage of responses which are neither 
popular nor idiosyncratic. 

‘A—index of popularity (see T) of all initial 
responses given by S. 

M—the number of specific responses which are 
given by S to more than one stimulus. 

M’—M times the number of times it is given 


by S. : 
M'%—the percentage of M’ responses. 
In Study II, only the F and I% measures were used. 


RESULTS 


The analysis of the data proceeded in three 
general steps. The first consisted of a factor 
analysis of the response measures listed in the 
Method section. On the basis of this analysis, 
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the two variables of response frequency and 
percentage of idiosyncratic responses were 
selected for more detailed testing of the five 
experimental hypotheses. 


Factor Analysis‘ 


The 13 variables listed above were used to 
obtain a correlation matrix based on the scores 
of the 20 Ss used in Study I.® For all measures, 
the scores computed for each S were obtained 
from all the 40 stimuli used for all Ss. Two 
factors were obtained, and one orthogonal ro- 
tation resulted in the factor loadings shown in 
Table 1. 

The two factors can be relatively easily 
identified. Factor A shows the highest loadings 
for number of popular responses (P), number 
of popular responses given initially (P’), pop- 
ularity of all initial responses (A), the mean 
index of popularity of all responses (T/F), and 
a high negative loading in the percentage of 
idiosyncratic responses (1%). This cluster of 
variables leads to the designation of the first 
factor as a popularity dimension, i.e., the tend- 
ency to give common responses or to give a 
low percentage of idiosyncratic responses. Fac- 
tor B shows the highest loadings in the fre- 
quency of response (F), total number of 
idiosyncratic responses (I), and the tendency 
to repeat identical responses to different stim- 
uli (M and M’). All of these variables can be 
conceptually related to Ss’ general frequency 
of response output, and the factor can be des- 
ignated as a frequency dimension. On the basis 
of these two factors, one variable was selected 
from each of the two for further analysis. Fre- 
quency of response (F) was selected as repre- 
sentative of Factor B and percentage of 
idiosyncratic response (1%) as representative 
of Factor A. The selection of 1% was based 
not only on the high negative loading this 
variable showed on Factor A, but also because 
it provided a more convenient scoring syste 
for Study II than the other popularity meas- 
ures. 


Response Frequency 
The individual difference variable showed 2 
high degree of subject-to-subject variability 1 
both studies. In Study I, the mean number © 
‘Dr. Ray Hyman’s assistance in this analysis $5 


gratefully acknowledged. 
5 See Footnote 2. 
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TABLE 1 


ROTATED Factor LOADINGS FOR ALL 
VARIABLES IN STUDY I 


Factor Loadings 


Variable 
Factor A Factor B 

: +331 +.89 
= —.11 +.91 
1% —.75 + .44 
P +86 die 
P% +.60 —.66 
P +.86 +.20 
T + .66 +.65 
T/T +.78 —.55 
X% +.70 —.07 
A +.86 +.12 
M, +.14 +.96 
M’ +.13 +.97 
M'% —.05 +169 


responses elicited by a stimulus in a 30-sec. 
period was 6.9.6 Subject means ranged from 
2.6 to 16.0 with a standard deviation of 3.63. 
In Study II, the mean was 6.0, range 2.5 to 
15.1, standard deviation 3.52. Thus, this vari- 
able providesa sensitive differentiation between 
individuals, with the most responsive Ss pro- 
ducing as much as six times as many responses 
on the average than the least responsive Ss. 

Analyses of variance, summarized in Table 
2, were performed on the frequency data from 
both studies. In both studies, the largest con- 
tribution to variance was from differences be- 
tween Ss. Stimulus classes and stimulus differ- 
ences within classes (in Study I), however, 
were highly significant sources of variation. 
Additional analysis of Study I revealed that 
the four stimulus classes chosen showed greater 
variability between classes than within classes 
(F = 6.47, p < -005). Similarly, differences 
between Ss obviously contributed more to var- 
iance than differences between the stimuli 
used. Finally, the interaction between indi- 
viduals and stimulus classes in Study I was 
significantly greater than the interaction be- 
tween individuals and stimuli within classes. 
This finding indicates that not only do Ss react 
differently to different kinds of stimuli, but 
that this differentiation is greater than the 
differential reaction to stimuli within a par- 
ticular class. 

The absolute differences in frequency of 
responses elicited by different stimulus classes 
were not, however, very great. Table 3 shows 


ê See Footnote 2. 


the means of the various classes, and the ranges 
within these classes in Study I. While the 
means show relatively small differences be- 
tween stimulus classes, the full range for the 
stimuli from 4.8 (for the nonsense syllable 
“Duj”) to 8.8 (for the adjective “Cordial” and 
the drawing of a house) suggests substantial 
differences between stimuli in their power to 
elicit a greater or lesser frequency of response. 
A surprising finding was the relatively small 
range of differentiation of Rorschach cards 
when compared with the other stimulus 
classes. 

In Study II, where stress was introduced, all 
Ss’ responses to stimuli within a particular 
class were used as the unit of response. In 
addition to the major effects found already in 
Study I, a significant source of variance was 
associated with the first and second halves of 
the experiment. This effect must be viewed in 
the light of the significant Halves X Condi- 
tions interaction. It will be recalled that all Ss 
were presented with the same stimulus condi- 
tions controlled for serial presentation. The 
experimental group, however, was subjected to 
failure stress after the first half of the stimuli 
had been presented. Thus, any effect due to 
the introduction of this variable should show 
up in the Halves X Conditions interaction. 
This significant source of variance can be 
attributed to a significantly greater increase 
in response frequency in the experimental 
group. While the control group showed a mean 
increase of 13 per cent, the experimental group 
showed an increase of 30 per cent. One of the 
effects of stress, therefore, was to increase the 
S’s rate of responding well beyond a general 
increase in responsivity from the first to the 
second half of the experimental situation. The 
only other significant finding was the quad- 
ruple interaction between Stimulus Classes, 
Sex of Ss, Conditions, and Halves (p < 05). 
Without prior predictions it is difficult to in- 
terpret such a complex interaction effect. 

The final question to be raised concerning 
the frequency variable concerns the consist- 
ency of response frequency across different 
stimulus classes. Table 4 shows a measure, the 
index of concordance (W), which indicates this 
consistency tendency. Ws were computed for 
all Ss in Study I, for the control group in 
Study II, and separately for the pre- and post- 
experimental stimuli in the experimental 
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TABLE 2 
SUMMARY OF ANALYSES OF VARIANCE OF RESPONSE FREQUENCY IN STUDIES I AND II 
Study I Study II 
Source Mean A Mean 5 
df Square F p af Square r p 
Total 799 17.1 = = 191 122.0 — 
Between Ss 19 533.1 48.86 <.001 20 789.3 22.41 <.001 
Stimulus Classes 3 63.2 5.79 <.005 3 300.3 18.37 <.001 
Stimuli within Classes 36 9.8 3.69 <.001 — — — 
Ss X Stimulus Classes 57 10.9 4.11 <.001 _ — — = 
Halves* the = = 7 = 1 740.0 21.39 <.001 
Halves X Conditionst = = = = 1 153.0 4.42 <.05 
* Halves” refers to the first and second half of Study II. 
t “Conditions” refers to the experimental vs. control groups in Study II. 
TABLE 4 


TABLE 3 
MEAN AND RANGE OF RESPONSE Frequency (F) 
ELicireED BY Four Stmrurus CLASSES 


Consistency OF INDIVIDUAL DIFFERENCES Across 
Four STIMULUS CLASSES 


Stimulus Class Mean Range F* I%t 
‘aaa A A s z5 ETA Source of Data — 
onsense syllables $ 8-7. 
aaee Lel 6.1-8.8 wE w 
i = 5-8. ws Hi 
re ee cand se ee Study T (20 Ss, 40 stimuli) |.900|<.01).672|<.01 
udy 
Control (12 Ss, 24 stimuli) _|.872|<.01 691) < .01 
P z 7 Pre-experimental (12 Ss, 12 
group. It is apparent that in all cases the index stimuli) .943| < .01|.315| -30 
was very high, ranging from . 72 to .943. i Postexperimental (12 Ss, 12 
ry ga, ranging 872 to .943. This stimuli) 882| < .01|.644|< -01 


finding indicates that Ss who gave a high fre- 
quency of responses to any one stimulus class 
also tended to give a high frequency of re- 
sponses to any other stimulus class. This tend- 
ency did not seem to be affected by the intro- 
duction of the experimental stress variable. 


Idiosyncratic Responses 

It will be recalled that the percentage of S’s 
hich that S, and no other S, gave 
lar stimulus was chosen to repre- 
ularity factor derived from the 
factor analytic investigation. In Study I, the 
relationship between F and I% was found to 
show a nonsignificant rank-order correlation of 
+.19. In Study H, however, this relationship, 
using all 24 Ss on the first half of the stimuli, 
was +.72 = < 01). This discrepancy led 
to a further examination of the results of 
Study I. It was discovered that the low posi- 
tive correlation could be attributed largely to 
the behavior of two discrepant Ss. These indi- 
viduals showed the two lowest ranks on fre- 
quency of response and the two highest ranks 
f idiosyncratic responses. Using 


on percentage o 
only the remaining 18 Ss, the rank-order cor- 
relation between the two measures increased 


responses W 
to a particu! 
sent the pop’ 


* F = Response Frequency. 
t 1% = Percentage of Idiosyncratic Responses. 


to +.63 (p — < .01). A further investigation 
of these two Ss lent support to the hypothesis 
that this negative correlation between fre- 
quency and percentage of idiosyncratic re- 
sponses may be diagnostic of deviant adjust- 
ment. The following two quotations from the 
E’s postexperimental notes illustrate this 
point: 

Subject A: “.. 
swer hadn’t been too ‘vague’ . 
lary is broad but he sometimes 
remember what a word meant . 
to know more about psychology, 


self and normality...” 
Subject B: “...a bit paranoid, at least 


suspicious and unsure of self... might be 
witty, clever in a hostile way. . , probably 
deep-seated problems and well entrenche 
defenses...” F 
These impressionistic comments tend to bea" 
out the supposition of some pathological gee 
esses separating these Ss from the rest of 1 
sample. As far as their response behavior Y 


. he said he hoped his an- 
. . his vocabu- 
couldn’t 
.. he wants 
about him- 
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TABLE 5 


PERCENTAGE IDIOSYNCRATIC RESPONSES (1%) FOR 
THE Four STIMULUS CLASSES 


Total I ra 
Stimulus Class ise. ae S CAA 
Study I [Study 11| Study I 
Nonsense syllables 54% 58% | 22%-91 
Adjectives 336, | 34% | 26% EA 
preio 51% 55% | 25%-83% 
orschach cards 62% 57% | 37%-83% 


concerned, they were very inexpressive indi- 
viduals, apparently tending to inhibit many 
of their responses. The resultant behavior was 
highly idiosyncratic; both Ss gave 76 per cent 
idiosyncratic responses. Comparable individu- 
als either gave a large number of responses 
with a high percentage of idiosyncratic ones, 


or else gave few responses, most of which 


_ idiosyncratic responses was surpri 


-appears that the e 


would tend to be given also by other Ss. 
The range of 1% showed large variability. 
In Study I, for all 20 Ss, 1% varied from 30 


per cent to 76 per cent.” The percentage of 

isingly stable 
from stimulus class to stimulus class, as shown 
in Table 5. 


Once again there were large individual differ- 


ences in this variable, and relatively small 
differences between stimulus classes. As in the 
frequency variable, it is surprising that the 
Rorschach stimuli did not appear qualitatively 
distinct from the other stimulus classes. They 
elicited somewhat more idiosyncratic material, 
but only by a few percentage points. 

The effect of the experimental conditions on 
the relative frequency of idiosyncratic re- 
sponses was similar to the effect on response 
frequency. An analysis of covariance of 1% 
scores of the control and experimental groups 
showed a greater increase in the experimental 
group at the 05 level of significance. When 
total numbers of idiosyncratic responses were 
examined, it appeared that the general increase 
in frequency from the first to the second half 
differed for the two groups: In the control 
group, only one per cent of the response incre- 
ment in the second half of the experiment con- 
sisted of idiosyncratic responses, whereas in the 
experimental group, 91 per cent of the addi- 


tional responses were idiosyncratic. Thus, it 
s was to 


ffect of failure stres: 


7 See Footnote 2. 
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REPRESENTS THE TOTAL I% ELiciTED BY ALL 


THE INITIAL, SECOND, THIRD, ETC., RESPONSES 
GIVEN By ALL Ss 


PERCENT IDIOSYNCRATIC RESPONSES 


increase the productivity of idiosyncratic ma- 
terial. 

Table 4 shows the index of concordance for 
I% in the various groups. Once again, though 
not as marked as for the frequency variable, 
there was a high degree of consistency from 
stimulus class to stimulus class. The only ex- 
ception was the pre-experimental group, where 
consistency was not significant. These same Ss, 
however, seemed to achieve a significant de- 
gree of consistency following the failure stress. 
The nonsignificant finding prior to the stress 
may be a function of the relatively small num- 
ber of stimuli used in determining this measure. 
In any case, the increase in W as a function of 
the experimental variable seems to indicate 
that Ss show greater consistency in their post- 
stress responses or, in other words, that indi- 
vidual differences are more emphasized as a 
function of the stress. 

The data also provide evidence concerning 
the degree of idiosyncratic responding as a 
function of successive associations. Figure 1 
shows the total percentage of idiosyncratic 
responses elicited by each of the first ten asso- 
ciations given to all stimuli. Since there were 
no differences between subgroups in Study II, 
the data from all 24 Ss were combined. The 
figure shows that while the initial response to 
the stimuli was idiosyncratic in about 30 per 
cent of the cases, this percentage rose steadily 
to about the 60 per cent level by the fifth or 
sixth response given. Little increase occurred 
after that point. These data indicate that 
while Ss on the whole tend to give primarily 
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nonidiosyncratic material during their first few 
associations, their later associations tend to be 
primarily idiosyncratic. However, no increase 
in the percentage of such material is obtained 
by the addition of more responses. This sug- 
gests that in a word association task, for ex- 
ample, less than 10 responses to any one stim- 
ulus are adequate to reach a high level of 
elicitation of personal, idiosyncratic material. 


Discussion 


The findings bear out the suggestion that a 
controlled association situation can be used 
profitably in describing individual differences 
in the frequency and idiosyncracy of readily 
available associative material. The general 
hypotheses have been substantiated: 

1. Differences between Ss show a wide range 
of values and constitute a highly significant 
and consistent source of variance. 

2. Individuals show highly consistent modes 
of responding to a variety of different stimuli. 
Both the frequency of response and the idio- 
syncratic content of these responses show re- 
peatable orderings of Ss across stimulus 
classes. 

3. For the particular stimulus classes used, 
there are consistent differences in the amount 
of material these groups of stimuli elicit. While 
these stimuli were not specifically chosen to 
provide for a wide range of response variation 
between classes, they do represent a rather 
wide variety of visually presentable materials. 
The particular antecedent learning conditions 
which make for differentiations between these 
stimulus groups are of no particular concern to 
the present investigation. However, the ques- 
tion might be raised whether these differences 
are pervasive and repeatable under different 
conditions or whether the particular differ- 
ences between classes found might be restricted 
to the particular situation and mode of pres- 
entation used in these studies. 

4. A failure-stress situation produces meas- 
urable differences in response frequency and 
idiosyncratic content. It appears that the Ss 

in Study II responded to the report of failure 
and the exhortation to do better by producing 
more associations, and that these additional 
associations were primarily of an idiosyncratic 
character. It is, of course, open to question 
whether this represents a reaction to stress or 
anxiety per se, or whether it is a function of 


GEORGE MANDLER AND ELIZABETH W. PARNES 


the particular instructions used. The Ss’ im- 
mediate reaction may well have been a con- 
scious attempt specifically to produce more 
idiosyncratic responses. Taking part in a psy- 
chological experiment may well imply that one 
is expected to produce personal associations, 
while failure in a free responding situation is 
easily interpretable as a criticism of slow or 
inadequate response rate. 

It has been indicated above that the Ror- 
schach cards did not, on the basis of the meas- 
ures used, represent a qualitatively different 
set of stimuli, Only on the percentage of idio- 
syncratic responses did they show a slight 
tendency to elicit more such materials than the 
other stimulus classes. This limited finding 
need not constitute an indictment of inkblots 
as useful stimulus material. The response cate- 
gories utilized are extremely limited, and an 
extended content analysis of the material 
elicited by the different stimulus classes might 
indicate some advantage from the responses 
given to the Rorschach cards in terms of per- 
sonality theory concepts. An impressionistic 
survey of the responses, however, does not sub- 
stantiate this hypothesis at first glance. 
Rather, it appears that if the responses to the 
Rorschach cards tend to be useful in that 
sense, the S is also likely to give similar con- 
tent to the other stimuli. This does suggest 
that diagnostic devices might well include more 
mundane stimulus materials than the inkblots- 

The finding of deviant behavior in the cases 
of two Ss in Study I suggests that the present 
method might be refined for diagnostic PUT 
poses. Deviance in terms of idiosyncratic 
thought processes, such as in schizophren!4, 
might be predicted to show just such high 
idiosyncratic response frequencies couple 
with low rates of responding as was found 12 
these cases. However, such speculation must 
await further studies of controlled associations 
in particular pathological populations. 

In these studies the response variables havé 
not been related to any other personality meas” 
ures. The main reason is that this investigatio” 
was considered primarily exploratory in chat” 
acter. Some theoretical notions bath from €% 


perimental and personality studies have sug 


gested that the associations freely available t9 
a subject are theoretically and pragmatica 
useful in describing other aspects of his be 
havior. These studies have shown in a limite 
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context that these responses prove to be con- 
sistent and measurable aspects of individual 
differences. Rather than relate them to other 
personality measures, when a rationale for 
such relationships is lacking, it is suggested 
that further investigation of the antecedent 
conditions and behavioral correlates of these 
response measures would be more relevant and 
in the long run more useful. 


SUMMARY 


Two studies were conducted to test five 
general hypotheses about the frequency and 
content of verbal associations elicited during a 
30-sec. period. In the first study, 20 Ss re- 
sponded to 40 stimuli, 10 each from four stim- 
ulus classes: Nonsense Syllables, Adjectives, 
Line Drawings, and Rorschach cards. In the 
second study, the effect of a failure-stress situ- 
ation on response frequency and content was 
tested with a total of 24 Ss. 

Two variables, the frequency of response 
and the idiosyncracy of response (defined as 
the percentage of individual-specific responses) 
were selected for detailed study on the basis of 
a factor analysis of 13 response variables. The 
main findings were as follows: 

1. Differences between Ss in frequency and 
idiosyncracy showed a wide range of values. 
Individual differences in response frequency 
contributed the most significant source of 


variation. 

2. There were y 
cies in frequency and idiosyn 
across stimulus classes. 

3. There were significant differences between 
mulus classes in the frequency of responses 


highly significant consisten- 
cracy within Ss 


sti 


elicited. Variability between stimulus classes 
was significantly greater than within stimulus 


classes. 
4. The failure-stress situation tended to in- 


crease the productivity of idiosyncratic ma- 


terial. 

The results indicate the utility of “con- 
trolled” associations as an individual difference 
variable and their possible use as a diagnostic 


tool. 
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THE RELATION OF EGO CONTROL TO OVERT AGGRESSION 
AND DEPENDENCY 


NORMAN LIVSON anp PAUL H. MUSSEN! 
University of California 


ECENT research indicates that aggres- 
sion and dependency are, in large 
part, learned drives and that individ- 

ual differences in strength of these drives are 
related to early learning experiences. Thus, 
the strength of the young child’s dependency 
drive has been shown to be a consequent of 
amount of feeding frustration during infancy 
and degree of maternal nurturance and pun- 
ishment during the preschool years (10), while 
maternal discipline and the extent of the child’s 
identification with an aggressive like-sexed 
parent appear to be related to the strength of 
the child’s aggressive drive (8, 10). 

The amount of overt aggression and depend- 
ence the child manifests, however, is a function 
not only of the strength of his drives but also 
of the strength of the inhibitions against the 
expression of these drives. In the course of be- 
coming socialized, i.e., acquiring the culturally 
customary and accepted forms of behavior, the 
child learns to conform with the taboos against 
the expression of anger, sexual impulses, and 
tendencies to be dependent (5). Individual 
differences among children in strength of inhi- 
bition of these impulses have been shown to be 
related to social class membership. For ex- 
ample, in the American middle class, strong 
pressures for the inhibition of aggressive and 
sexual behavior are applied relatively early in 
the child’s life (6, 11). Independence training 
in this class is also initiated when the child is 
very young, although it is completed at a late 
age. Thus, compared with lower class children, 
those of the middle class are more likely to 
withhold overt manifestations of aggression, 

sexuality, and dependency. Knowledge of class 
differences, however, tells us little about how 
the child develops control of his drives. To 
understand this process, it is necessary to focus 
upon the individual experiences of the child. 
According to psychoanalytic theory, the 
internalization of relatively strong inhibitions 
against the expression of these impulses may be 
1 The authors wish to acknowledge their gratitude to 
Dr. Ernest Haggard for his helpful criticism of this 


manuscript. 
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attributed to the functioning of the ego, the 
organized, executive subdivision of personality. 
This theory maintains that, as the child be- 
comes aware of the reality principle and as his 
ego begins to develop, he acquires a rudimen- 
tary ability to test reality, to tolerate or bind 
tensions and to postpone gratifications. These 
abilities, which are major aspects or compo- 
nents of ego-control capacity or ego strength, 
presumably enable the individual to control 
his impulses and to cope adequately with 
frustrations. 

With further development of ego-control 
capacity, there is increased recognition of, and 
adjustment to, culturally defined reality. So- 
cial drives become relatively more influential, 
compared with primary drives and immediate 
impulses, in guiding the child’s perception of 
the social environment and in directing his so- 
cial behavior. A child who acquires a high de- 
gree of ego-control capacity is likely to manifest 
more socialized behavior—behavior which con- 
forms with socially accepted standards—than 
a child whose ego-control capacity is less 
highly developed. 

Based on these theoretical considerations, 
the present investigation was designed to test 
the general hypothesis that individual differ- 
ences in ego-control capacity are related to 
degrees of inhibition of aggression and depend- 
ency behavior. 

It was first necessary to obtain measures of 
generalized ego-control capacity sufficiently 
sensitive and objective to provide reliable 
quantitative indices of individual differences 
in ego control in a group of normal children. 
Block and Martin (3) have presented two ex- 
perimental techniques that elicit behaviors 
which permit assessment of two important 
facets of ego-control capacity. They obtained 
measures of nursery school children’s abilities 
to delay gratification and to bind tension 1 
two procedures. These measures were relate 
to children’s tendencies to regress and tO 
attack the barrier in a replication of the Bar- 
ker, Dembo, and Lewin experiment (2). I” 
general, the results supported their hypotheses 
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that lack of ego control in children is associated 
both with a decrement in play constructiveness 
and with a tendency to attack the frustrating 
barrier directly. i 
_ The rationale underlying Block and Mar- 
tin’s use of the delay of gratification and co- 
satiation techniques stemmed from Lewinian 
conceptions, specifically those dealing with 
permeability of boundaries between need sys- 
tems. However, it is also possible to view both 
tasks as frustration procedures and, hence, to 
consider the individual’s performance as an 
index of his frustration tolerance—a basic as- 
pect of his ego-control capacity. The reasoning 
underlying this formulation seems immediately 
apparent for delay of gratification; for behavior 
in the cosatiation task, it draws upon Burton’s 
(4) finding that satiation may be psychologi- 
cally the equivalent of frustration. Whatever 
the rationale for the use of these techniques, if 
they yield indices that provide valid measures 
of general internalized ego control, they should 
relate to the child’s ability to inhibit the overt 
expression of socially disapproved drives in 
everyday life. 


Hypotheses 

We propose to test the following two specific 
hypotheses in this study: 

1. Among middle class children, there is a 
negative relationship between children’s ego- 
control capacities and the amount of overt 
aggression they express in the nursery school 
situation. 

2. In this group, there is also a negative re- 
lationship between ego-control capacity and 
degree of overt dependent behavior in nursery 


school children. 
METHOD 


Subjects 

Subjects (Ss) were 36 children, 19 boys and 17 girls, 
age from 38 to 59 months. Of this group, the 
ded the morning session, and the 
rnoon session, of the nursery 
school at the Institute of Child Welfare of the Uni- 
versity of California. The sample is a very select one: 
almost all of the children come from highly educated, 


upper-middle class families. 


Measures of Ego-Contro? 

The two techniques for measuring ego control, co- 
satiation and delay of gratification, were adopted with 
slight modifications from Block and Martin (3). 


ranging in 
three-year-olds atten 
four-year-olds the afte 


2 We are indebted to Dr. Frances Orr for lending the 
apparatus for these procedures. 


The first step in the cosatiation procedure was to 
show the S how a number of spools could be packed in 
a rectangular rack on top of a box and then be made to 
fall into the box by sliding the rack. After the spools 
had fallen, the rack could be refilled and the same pro- 
cedure repeated. The child was then told that he could 
play the game as long as he wished and he began oper- 
ating the sliding rack himself. As soon as he indicated 
that he did not want to perform the task anymore, i.e., 
when he was satiated, the experimenter (E) recorded 
the number of trials of spool packing. Æ then placed 
colored blocks into the rack, telling the child he could 
play with these, as he had with spools, as long as he 
wished. The number of trials for this second perfor- 
mance was recorded. 

The extent of deterioration in performance on the 
second task, following satiation on the prior one, may 
be expressed by an index of cosatiation very similar to 
the one used by Kounin (7). Performance on the first 
task serves as the basis of comparison: trials to satiation 
on the first task (spools) minus trials to satiation on the 
second (blocks) is divided by trials to satiation on the 
first task. This was the Cosatiation score. A high Co- 
satiation index is regarded as indicating relatively low 
ego control, i.e., satiation in the first task resulted in 
poorer performance in the second. 

The experiment on delay of gratification permits 
direct quantitative assessment of the child’s ability to 
delay immediate gratification of needs in order to attain 
greater rewards in the future. At the beginning of the 
experiment, Æ showed the child a small bow! containing 
a variety of small toys (commercially marked as brace- 
let “charms”). The S was then shown a bag full of 
small packages wrapped in wax paper and was told 
that each package contained a toy like those displayed 
in the bowl. It was explained that he would be given all 
the toys he accumulated. However, once stopped, he 


could not begin again. 
The apparatus and procedure for acquiring the toys 


was then demonstrated to the child. The apparatus was 
a toy known as a “coke loader” which consists of a 
crank-operated cart that could be moved up an incline 
until it tipped to empty the contents. The procedure 
was to put one toy at a time into the cart and allow S 
to operate the crank until the toy fell into a box. After 
insuring that S understood how the toy worked, Æ 
again told him that he could win as many toys as he 
liked and, at his request, could stop playing and be 
given all the toys in the box. If the child had not volun- 
tarily stopped by the time he had accumulated 20 toys, 
E ended the session by saying that time was up. 

It was assumed that the child was in a conflict situ- 
ation during this procedure; that he would want to 
claim his packages quickly to discover what they con- 
tained and to play with the toys, while at the same 
time, that he would want to get more toys. Presumably, 
the child who generally has a greater capacity to delay 
gratification would be able to control his desires to have 
his packages immediately and t 
tents. Hence he would be expected to accumulate a 
relatively large number of toys before deciding to stop. 
The delay of gratification (Delay) score was simply the 
number of packages accumulated in the box. 


o examine their con- 
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Measures of Overt Aggression and Dependency’ 


These were based on reports of observations of chil- 
dren’s nursery school behavior by the three teachers in 
each (morning and afternoon) session. Twelve children 
jn each session were selected for special observation for 
a period of two weeks or ten school days. This procedure 
was repeated for two more groups of 12 in a subsequent 
two-week session. Of the total of 48 children observed, 
36 were tested in both ego-control situations, and com- 
prise the sample of this study. : 5 

Two types of records were made: daily behavior 
reports for aggression and dependency, and weekly 
rankings of the 12 children being observed with respect 
to these two characteristics. The daily behavior report 
for aggression consisted of a checklist of 17 items of 
aggressive behavior. Included were: interfering with 
others’ activities; teasing or insulting; threatening; 
quarrelling, screaming or swearing; physically attack- 
ing; throwing objects; destroying something intention- 
ally; disobeying; temper tantrums. For dependency 
behavior there was a twelve-item checklist that in- 
cluded seeking physical contact, asking for help, seeking 
attention and/or approval, needing security object, 
wanting to go home, revealing hurt feelings, “helpless” 
crying, and asking permission unnecessarily. Each 

teacher filled out these two checklists for each child 
observed on that day. 

The incidence of aggressive and dependent behavior 
as indicated by the total number of items checked by 
one or more teachers on each day was tabulated for 
each child over the entire observation period. Unfor- 
tunately, as a result of children’s absences from school, 
not all Ss were observed for the same number of days; 
consequently, the number of daily behavior reports 
available was not the same for all Ss. In order to obtain 
comparable measures, the total incidence of aggressive 
and dependent behaviors for each S was divided by the 
number of days he was observed, yielding aggressive 
Acts/Day and dependent Acts/Day scores. These 
Scores were not computed for the three Ss who were 
absent from school for more than five of the ten obser- 
vation days. The remaining 33 Ss had aggression scores 
ranging from 0 (least) to 4.3 (most), with a median of 
+30; while the dependent Acts/Day scores ranged from 
0 to 3.4, with a median of .65. 

At the end of each week of observation, each of the 
three teachers independently ranked the 12 children she 
had been observing that week with respect to overall 
overt expression of aggression and dependency. Thus, 
there were six weekly rankings on each trait for each 
child which were summed to obtain his Pooled Ranks 
scores for each of the two behaviors. 

The interrater reliability for the Pooled Ranks 
scores was determined in the following way. For each 
trait, each teacher-observer’s average rank for the 
child, based on the two weekly rankings she assigned 
him, was treated as a score. The resulting scores of the 
morning and afternoon groups were then ranked 
separately. The rank-difference correlations between 


3 We wish to thank E. Peters and H. Tyau, head 
teachers, and E. Benjamin, E. Boydston, E. Elson, and 
H. Shimomaki, nursery school assistants, for their 
cooperation in this aspect of the study. 


each teacher in a session and the other two teachers in 
that session were computed. The median coefficient for 
aggression (based on six coefficients, three from each of 
the two sets of observers) was .80, for dependency, .48. 
In order to increase the low reliability of the dependence 
Pooled Ranks score, the analysis excluded those cases 
about which the observers showed the most disagree- 
ment. Specifically, if the average ranks on dependency 
assigned to S by any pair of observers differed by more 
than three units, no score was computed for him. This 
procedure reduced the number of Pooled Rank scores 
for dependence from 33 to 22. 


RESULTS AND Discussion 


In order to test the two major hypotheses, 
the measures of ego control were related to the 
indices of overt aggression and dependence. 
Since the distributions of scores on Delay and 
Cosatiation were non-normal and included a 
relatively large number of tied scores, the 
sample was dichotomized into High and Low 
ego-control groups on each of the measures, 
the dichotomization point in each case being 
selected so as to provide as nearly equal num- 
bers as possible in the two groups. (Since High 
ego control is indicated by a low score in Co- 
satiation and a high score in Delay, the terms 
High and Low will henceforth be used to refer 
only to ego-control status.) For Cosatiation 
(range, —.20 to .10; median, zero) the High 
ego-control group (V = 19) thus constituted 
included scores of zero and below; the Low 
group (V = 17), scores above zero. On the 
Delay measure, the 20 Ss who achieved the 
maximum score of 20 trials comprised the 
High group, while the Low group (V = 16) 
included the remaining Ss whose scores ranged 
from 5 to 18. 

Composite ego-control categories were de- 
vised in order to test the combined predictive 
ability of the two ego-control measures which 
tended to be positively, though not signifi- 
cantly, related (P = .20 by a U test comparing 
the actual Cosatiation scores of the High and 
Low Delay groups). The High Composite 
group (NV = 11) included those Ss who were 
classified as High ego control on both Cosatia- 
tion and Delay; the Low Composite group 
(N = 8) was composed of Ss assigned to the 
Low groups on both measures. 

In view of the dichotomization of the ego- 
control measures and the non-normality of all 
four of the behavior score total distributions, 
the U test (9) was employed to compare rank 
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transformations of all overt behavior scores of of these tests and their significance levels for 
the Highs and Lows on each measure of ego the experimental hypotheses are summarized 
control. (Figure 1 summarizes all these data in Table 1. 
ae shows each S’s ego-control status—High Compared with Ss high in ego control, as 
E from the two experimental measured by the Cosatiation score, those low 
a hes > sl goon and dependency in this measure of ego control scored signifi- 
a : or gend in upper right portion of cantly higher on both indices of overt aggres- 
g. 1 indicates ego-control status on both sion. There was also some tendency for those 
Cosatiation and Delay measures, zol actual who showed relatively high ability to delay 
scores, “H” and “L” in key refer to ego-control gratification to express fewer aggressive Acts/ 
level on the two measures. Orientation of the Day. For Pooled Ranks-Aggression, the differ- 
Acts/Day and Pooled Ranks scales are the ence between High and Low ego-control groups 
same; in both cases, the upper end of the scale on Delay was in the same direction, though 
denotes greater amount of the behavior. Refer- the difference was not significant. When the 
ence to the appropriate portions of this figure two ego-control measures were combined into 
will serve to illustrate the nature of the com- the Composite score, the less controlled Ss 
parison made by a given U test.) The results were found to be significantly more aggressive 
than the more highly controlled Ss. Thus, there 
is general support of the first hypothesis, that 


AGGRESSION DEPENDENCE ts ` <a 
Legend ego-control capacity 1S positively related to 


Sasser ability to withhold expression of aggression. 


40°44 
” biter aes Since the data from children of both sexes 
È 25-29 o]oe and of two age groups were combined in the 
are ae analyses, these relationships might be attribut- 
5 rona able to some extent to covariation of the meas- 
‘| ene ures of ego control and aggression with sex 
Vatar Sa TET ATATA TRENT ESETT and/or age. Thus, it is possible that girls de- 
a cy velop greater ego-control capacity earlier and 
Z n nae are also less likely to express aggression overtly. 
Era N-36 Similarly, both ego control and inhibition of 
2 cosas overt aggression may increase with age. In our 
È 60-69 sample, girls did show significantly fewer ag- 
wor gressive Acts/Day but were not significantly 
Fro. 1, DISTRIBUTIONS OF AGGRESSION AND DEPEND- different from boys on either of the ego-control 


ENCY SCORES FOR Hicn anp Low Eco-ContRoL ‘ 
Groups. Eac Unit IN HISTOGRAM REPRESENTS an S. Measures. None of the measures of aggression 
(See text for explanation of legend.) or ego control showed significant relationships 

TABLE 1 


COMPARISONS OF DISTRIBUTIONS OF AGGRESSION AND DEPENDENCE SCORES BETWEEN HIGH AND 
Low Eco-Controt GROUPS 


Ego-Control Measure 


Behavior Measure Cosatiation Delay Composite 
N* ia E iii N U P N | & 
Aggression Acts/Day 33 (17, 16)| 86.5) .04/33 (18, 15)|105.5} -14 16 (9, 7) |12.5| .02 
Pooled Ranks |36 (19, 17)| 82.5|<.01|36 (20, 16)|138.5) > .20)19 (11, 8)}17.0 01 
Dependence Acts/Day 33 tee 16)|127.0|> .20|33 (18, 15)|111.5| .20) 16 (9, 7) |24.0|>.20 
Pooled Ranks |22 (10, 12)| 54.5|> -2022 (10, 12)| 58.5> -20/12 (5, 7) |13.0/>.20 
partitioning of this total into the Ns for 


* First number is the total N in the comparison; numbers in parentheses refer to the 


the High and Low ego-control groups, respectively. 
+% All U values in this table are less than the value to be expected by the null hypothesis (one half 


Ns) and are in the predicted direction. 
e+% All P values are based on one-ta 


the product of the two subgroups 


iled tests as is required by the hypotheses. 
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with age. Hence, partialling out sex or age 
should not attenuate the reported relation- 
ships. 

The data did not support the second hypoth- 

esis concerning the relationship between ego 
control and the overt expression of dependency 
needs. Although all six comparisons between 
High and Low groups on the three measures of 
ego control yielded differences in the predicted 
direction, none were statistically significant. 
This failure to detect relationships between 
ego-control and dependency may be due to 
several factors. It is quite possible, in the first 
place, that dependency is inherently difficult 
to observe and measure under conditions of 
free observation in a nursery school situation. 
The original low correlations among the ob- 
servers on the dependency rankings attest to 
this difficulty. Another factor to be considered 
is that these children come from a highly select 
and homogeneous group of upper-middle class, 
highly educated families, a very large propor- 
tion of the parents having advanced academic 
or professional degrees. In such a group “‘get- 
ting along with others” is an important social 
value and is often considered a criterion of 
adequate social adjustment. Hence, socializa- 
tion of aggressive impulses, especially in inter- 
personal situations, is likely to be emphasized 
from early childhood. While children in this 
group may be encouraged to acquire independ- 
ence gradually, these parents are also likely to 
be permissive with respect to the expression of 
dependency for a relatively long period of 
time. Consequently, the ego-control capacity 
of the preschool child may have little bearing 
upon the amount of dependent behavior he 
manifests. 

Our data offer some indirect evidence to 
support the hypothesis that there is less em- 
phasis on the control of dependence than of 
aggression in this sample. Examination of the 
frequency distributions for the Acts/Day 
scores in Figure 1 shows that the incidence of 
both aggression and dependence generate J 
curves which Allport (1) has demonstrated to 
be typical for behavior in a number of con- 
formity-demanding situations. He has reasoned 
that the steeper the slope at the high con- 

formity end of a scale, the stronger are the 
conformity-producing pressures influencing the 
partic ular behavior. Dependence exhibits a 
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more gradual slope than does aggression, with 
the relative number of very high conformists 
being greater for aggressive Acts/Day. This 
difference may be quantified by a comparison 
of the skewness of the two distributions; the 
skewness indices (3 [Mean-Median]/Standard 
Deviation) are 2.78 for aggression and only 
1.06 for dependence. This difference in skew- 
ness is consistent with the hypothesis of differ- 
ential conformity pressures for the control of 
these two drives. 

The data indicating that ego-control capac- 
ity is positively related to the inhibition of 
aggression in interpersonal relationships are 
consistent with the finding of Block and Mar- 
tin (3) that children high in ego control make 
relatively few direct attacks on a frustrating 
barrier in an experimental situation. Thus, it is 
possible to interpret the findings of the present 
study—as Block and Martin viewed their re- 
sults—as generally supportive of a Lewinian 
theoretical formulation. 

It is also of interest to note that in both 
studies, performance in the cosatiation situa- 
tion proved to be a better predicter than did 
delay of gratification scores. We can only sug- 
gest that the Cosatiation score may have been 
a purer, less confounded index of ego control 
than Delay. It measured the child’s ability to 
continue to perform without extrinsic rewards 
even though he had just been satiated and was 
free to stop whenever he wished. Such behav- 
ior seems to correspond closely to our definition 
of internalized ego control. In contrast, since 
the delay situation provided extrinsic rewards, 
the final Delay score may have represented a 
confounding of internalized ego-control factors 
and a desire to obtain the available rewards. 
Thus, a High ego-control score on this task 
may have been due either to ability to delay 
gratification, to high motivation to possess 
many toys, or to both. 

The results of the present study extend the 
meaning and implications of the concept of 
ego control by demonstrating how it may be 
useful in understanding and predicting one 
aspect of socialization, the inhibition of aggres- 
sive expression. Furthermore, some of the find- 
ings suggest that the concept has predictive 
utility only for behavior to which socialization 
pressure has been applied. Thus the absence of 
a significant relationship between ego-control 
measures and overt dependency in these data 
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may be due to the fact that there is less em- 
phasis on dependency training in this group of 
Ss than we had initially assumed. For similar 
reasons, we would not expect to find relation- 
ship between ego control and the inhibition of 
overt aggression in preschool children from a 
social class group in which the expression of 
overt aggression is not discouraged at this age. 
Conversely, in groups which stress early train- 
ing in the suppression of overt dependency 
behavior, our original hypothesis of negative 
correlation between ego-control capacity and 
the incidence of dependency manifestations 
should be confirmed. 


SUMMARY 


This study tests two hypotheses concerning 
the relationships between ego-control capacity 
in children and their ability to inhibit the overt 
expression of aggression and dependency needs. 
Thirty-six nursery school children were scored 
on indices of aggressive and dependent behav- 
ior manifested during a two-week observation 
period. These scores were related to two meas- 
ures of ego control derived from performance 
in a delay of gratification and in a cosatiation 
situation. 

There is a significant negative relationship 
between amount of expressed aggression and 
ego-control as assessed by the cosatiation 
measure. The delay measure is also negatively 
related to aggression, though not significantly. 
A composite ego-control score, involving both 
measures, was also related to aggression. 

Dependent behavior is not predictable from 
any of the ego-control indices. The argument 
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is made that ego-control capacity should be 
expected to relate to the degree of expression 
of a drive only when that drive has been sub- 


jected to socialization pressures. 


REFERENCES 


1. ALLPORT, F. H. The J-curve hypothesis of con- 
forming behavior. J. soc. Psychol., 1934, 5, 141- 
183. 

2. Barker, R. G., DENBO, Tamara, & LEWN, K. 
Frustration and regression: an experiment with 
young children. Univer. Iowa Stud. Child Welf., 
1941, 18, Whole No. 1. 

3. BLOCK, JEANNE, & Martin, B. Prediction of be- 
havior of children under frustration. J. abnorm. 
soc. Psychol., 1955, 51, 281-285. 

4. Burton, A. The aggression of young children fol- 
lowing satiation. Amer. J. Orthopsychiat., 1942, 
12, 262-267. 

5. Cmn, I. L. Socialization. In G. Lindzey (Ed.), - 
Handbook of social psychology. Cambridge: 
Addison-Wesley, 1954. Pp. 665-692. 

6. Davis, A., & HAVIGHURST, R. J. Social class and 
color differences in child-rearing. Amer. sociol. 
Rev., 1946, 11, 698-710. 

7. Kovxiy, J. S. Experimental studies of rigidity I. 

Charact. Pers., 1941, 9, 251-272. 

Levin, H., & Sears, R. R. Identification with 
parents as a determinant of doll play aggression. 
Child Develpm., 1956, 27, 135-153. 

Mann, H. B., & WHITEY, D. R. On a test of 
whether one of two random variables is stochas- 
tically larger than the other. Ann. math. Statist., 
1947, 18, 50-60. 

Sears, R. R., Wartine, J. W. M., Nowus, V., & 
Sears, P. S. Some child-rearing antecedents of 
aggression and dependency in young children. 
Genet. psychol. Monogr., 1953, 47, 135-234. 

Warrmo, J. W. M., & Cann, I. L. Child training 
and personality. New Haven: Yale Univer. Press, 


1953. 
Received August 30, 1956. 


10. 


11. 


ATTITUDE CHANGE THROUGH REWARD OF VERBAL BEHAVIOR! 


WILLIAM ABBOTT SCOTT 
University of Colorado 


through anecdote if not personal ac- 

quaintance, with the woman who, when 
asked her opinion on social security, replied 
that she didn’t know because no one had ever 
asked her, and she never knew what her opin- 
jons were until she opened her mouth and ex- 
pressed them. Perhaps this apocryphal re- 
spondent was displaying more insight into the 
psychological processes underlying her attitude 
than is readily apparent. Though cognitive ap- 
proaches to attitude development are more 
flattering in their emphasis on the rationally 
cognized relation of an object to the person’s 
goals (1, 3, 8, 9), nonrational determiners may 
be equally important in the acquisition of 
attitudes. 

According to Doob (4) an attitude may be 
regarded, like a habit, as an implicit anticipa- 
tory response which mediates overt behaviors, 
and arises out of them through response rein- 
forcement. Thus, it is conceivable that an 
opinion be expressed initially, in the absence of 
a supporting attitude, but if the verbal behav- 
ior is rewarded, the corresponding attitude may 
develop and mediate subsequent opinion ex- 
pressions in the presence of similar cues. This 
response reinforcement formulation thus fo- 
cuses more on reward of verbal expressions 
than on cognized means-end relationships as 
critical factors in attitude development. 

The two formulations, and the sets of vari- 
ables they employ, are by no means incom- 
patible. It is reasonable to suggest, for instance, 
that lacking a well structured and salient set of 
cognitive associations regarding an object, a 
person may be more readily induced to respond 
to it in nonlogical fashion, the outcome of his 
response then contributing to a nascent cogni- 
tive structure; but given a set of values and 
cognitions that can readily be applied to a new 
object, these will influence the person’s attitu- 


Ps opinion researchers are familiar, 


1 The author is indebted to Prof. Maurice P. Smith 
for his suggestions regarding design and report of the 
study, and to the following graduate teaching fellows 
who assisted in the collection of data: H. Hess, P. 
Khanna, A. Mathews, and I. Richardson. 
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dinal set and thus produce a response consist- 
ent with the pre-existing cognitive structure. It 
is not the purpose of this paper to propose a 
systematically inclusive formulation of these 
two classes of determinants, but rather to offer 
evidence concerning the efficacy of response 
reinforcement in the determination of atti- 
tudes. 

Other studies (2, 5, 10) have demonstrated 
the influence of social approval and disapproval 
on the modification of verbal behavior, but 
they provide no information about their effects 
on the underlying attitudes. The present study 
was designed to explore, in a controlled fashion, 
the effects on attitudes of reward and punish- 
ment of expressed opinions. Specifically, it was 
predicted that subjects (Ss) rewarded by group 
approval for expressing opinions opposite from 
their initial attitudes would show a change in 
the direction of the expressed opinions, while 
Ss punished by group disapproval for expres- 
sing contrary opinions would not show such a 
change. This prediction implies that reward of 
a new behavior increases the relative strength 
of the underlying predisposition, while punish- 
ment of a new behavior leads the S to revert 
to his formerly preferred response disposition. 


METHOD 


All students in 29 general psychology discussion 
sections were administered free-response questions on 
mimeographed sheets to assess their attitudes toward 
three controversial issues: 

1. Universal military training. It has been pro- 
posed that all physically able males between the 
ages of 18 and 25 be required to spend two years in 
the armed forces so that they will be trained and 
ready for armed service in the event of a national 
emergency. How do you feel about this proposal? 

2. Night hours for women students. Senior women 
at the University are not required to be back in their 
dorms or houses by any set time at night. Some 
students are requesting that all women be given the 
same freedom of hours. How do you feel about this? 

3. De-emphasis of football. It has been suggested 
that too much emphasis is placed on football at the 
University, in scholarships and special treatment for 
players. How do you feel about this? 

Each of the classes was assigned, in systematic 
fashion, one of the three issues for debate. The instruc- 
tor read the class members’ opinions on the designated 
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issue and selected from each class two which appeared 
definitely favorable and two definitely unfavorable. 
The four students were contacted outside of class and 
their cooperation requested in an “experiment to sec 
how much they could affect the opinions of class mem- 
bers” by debating the particular issue; but each student 
would take the side opposite to his own opinion. Two 
pairs of debators were formed for each class, members 
with “pro” opinions taking the “con” side of the argu- 
ment, and vice versa. It was explained that, following 
the debate, the quality of performance would be as- 
sessed in two ways: first, by a class vote on which mem- 
ber of each pair did the better job; and second, by a 
retest of the class’ opinions on the issue, “to see in 
which direction opinions are influenced.” 

The debators were counseled to avoid mentioning to 
anyone that they didn’t really believe what they were to 
say. (This was intended to help prevent the occurrence 
of interfering responses (see 7) which might negate the 
effect of the experimental stimulus.) In pairing the 
debators, an attempt was made to equate their verbal 
abilities, as evidenced in their written opinions, in 
order that a subsequent false report of the vote would 


appear valid. 
The debates were held two weeks after the original 


assessment of attitudes. One pair of opponents was 
called into the classroom, while the other pair waited in 
an adjoining room. The “pro” debator spoke first, for 
three minutes, followed by a three-minute presentation 
from the “con” debator. Then each had a two-minute 
rebuttal period, in reverse order. The first pair left the 
room, and the second pair of opponents entered for a 
similar performance. Then, with all four debators pres- 
ent, voting instructions were given to the class (“On the 
first ballot write which side—pro or con—did a better 
job on the first debate. On the second ballot write which 
side—pro or con—did a better job on the second 


debate”). , 

The ballots were handed in, and the instructor made 
a pretense of counting them, while one of the students 
tallied votes (as read by the instructor) on the black- 
board, under the name of each debator. It had been 
determined in advance for each pair of debators in all 
classes who would “win.” “Winners” and “losers” were 
assigned alternately through the pairs of debators, so 
that half the time the “pros” won, and half the time 
the “cons” won. 

The instructor then repeated the implications of the 
vote, mentioning the names of the “winners” and 
“Josers,” and explained that the next step in the judg- 
ment would be to see how the class’s attitudes had been 
affected. All class members were given sheets of paper 
on which were printed the statement of the issue that 
had been debated—exactly like the first opinion assess- 
ment, except that only one issue appeared on the sheet. 
Debators were also asked to write their present opin- 
ions, “to see if there was any change in them.” 

After the opinion retest forms were collected, the 
nature and purpose of the experiment were explained, 
the pre- and postopinions of the debators were read, 
and an attempt made to reduce the threat of the situa- 
tion for the “losers” and for those debators whose 


opinions had ostensibly shifted. 
The pre- and postopinions of the debators were sub- 


sequently typed on cards, after the deletion of certain 
passages that might indicate which were postopinions 
(such as “I still feel that .. .”). The cards were identi- 
fied only by numbers, duplicated on the original 
opinion questionnaires, and the numbers assigned in 
“random” fashion, so as to prevent contamination in 
coding. Opinions on the cards were coded on a seven- 
point scale: 1—very pro; 2—pro; 3—pro, qualified; 
4—neutral or balanced pro and con; 5—con, qualified; 
6—con; 7—very con. Then, by reference to the original 
questionnaires, cach card was identified as a pre- or 
postopinion of a “winner” or “loser.” 

Of the 58 pairs of debators initially designated, only 
36 finally produced complete data. Twenty-one pairs 
failed to debate, due either to absences or to reschedul- 
ing of classes; the postopinion of one member of the final 
pair was not codeable. The 36 debates took place in 19 
different classes, with six different instructors. From 
these classes a sample of 36 students whose original 
opinions were definitely pro or con, but who did not 
debate, was selected as a control group. Their pre- and 
postopinions were similarly transferred to cards with 
random indentification by numbers, and coded on the 
same seven-point scale. 


RESULTS 


It was predicted that debators who “won” 
would show an average change in the direction 
of their debate greater than the corresponding 
change shown by “losers” and greater than 
the average change of the control group. Data 
bearing on this prediction are presented in 
Table 1. Winners changed an average of 1.25 
points (in the direction of debate) on the 
seven-point scale, while losers changed an 
average of 0.17 points in the opposite direction 
(more extreme than their original positions). 
The control group changed an average of 0.31 


TABLE 1 
CHANGES OF WINNERS, LOSERS, AND CONTROLS 
FROM PRE- TO POSTTEST 
(Positive sign indicates mean change in direction 
opposite to Ss’ original opinions) 


i Con- 
Winners| Losers | tols 


W= | V= | Ne 
36) 36) 36) 


Number of changers 
(either direction) 

Number of changers 
(toward opp. direction) 

Mean change 

SD of change 


24 16 18 


21 7 11 
+1.25|—0.17/+0.31 
1.77| 1.17] 1.37 


Difference in mean changes: 
Winners vs. losers: £ = 3.97; p < 001. 


Winners vs. controls: £ = 2.49; p < .02. 
Losers vs. controls: #= —1.58; p > -10. 
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points toward the opinion opposite from their 
original one. 

The difference between mean changes of 
winners and losers is significant beyond the 
-001 level of confidence, and the difference be- 
tween winners and controls is significant be- 
yond the .02 level of confidence. The difference 
between losers and controls is not significant. 

An analysis of mean changes for the three 

issues separately, and for original Pros versus 
original cons showed no significant differences 
among these groups, so the prediction was con- 
firmed regardless of the issue and regardless of 
which extreme opinion the debator started 
with. When the proportions of changers in the 
three groups are compared, results are con- 
sistent with those for mean changes. More 
winning individuals shifted toward the oppo- 
site opinion than either losers or controls. 


Discussion 


The experiment was designed to give both 
winners and losers equivalent contact with 


Companying 
implicit responses—attitudes and cognitive 
Support for them. The vote of “lose” presum- 


ably weakened whatever resp 


If the opinion posttests be regarded as valid 
reflections of Ss’ attitudes following the ex- 
perimental stimulus, then one may conclude 
that social reward for expressing a new opinion 
tends to reinforce the concomitant attitudes, 
whereas social disapproval of the new behavior 
tends to lead to nonreinforcement of the ac- 
companying attitudes. It appears that, in the 
present experiment, the brief experience of 
reward or punishment was a significant factor 
in accounting for attitude change, rather than 


the more prolonged experience of cognitive 
contact with opposing arguments, 

This finding is, however, subject to at least 
two interpretations. It may be that reward of 
a new behavior was the crucial factor, while 
Cognitive contact in and of itself had no effect. 

i with Opposing argu- 
ments produced a change in attitude, which 


type of situation. 


It should be noted that very few of the 
changes in opinion were of a spectacular sort. 
(Only seven of the rewarded Ss reversed the 
direction of their opinions; the other fourteen 
changers merely showed some weakening of 
their initia] opinions, as indicated by a shift 
along the S¢ven-point scale toward the neutral 
Position.) Although the experience of success or 
failure was made as strong as possible under 
these restricted Conditions, it was nevertheless 
a brief and perhaps not too potent one. Be- 
tween the announcement of the “class vote” 
and the posttest of opinion no more than five 
minutes elapsed. 

The degree to which the new opinions per- 
sisted was Not assessed in this study. It is 
Teasonable to suppose that effects of the ex- 
perimental situation were transitory, since the 

8’ initial attitudes presumably found consid- 
erable social support in their current friendship 
groups. In order to produce enduring attitude 
changes, it would probably be necessary to so 
change the Ss’ social environments that the 
new behaviors would receive continued rein- 
forcement. But transitory or enduring, the 
Principles of attitude change illustrated in 
this experiment would appear to be applicable. 
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SUMMARY 


An experiment in attitude change was sug- 
gested by Doob’s learning theory of attitude 
development. Seventy-two Ss were induced to 
engage in debates on three different issues, 
taking sides opposite to those which they had 
indicated as their own in an opinion pretest. 
Half of the Ss were rewarded, in predetermined 
order, by a purported vote which proclaimed 
them the better debators, while the other half 
were punished by presumably losing the de- 
bate. Posttests of Ss’ opinions showed a tend- 
ency of the “winners” to change their opinions 
in the direction of their debates, while the 
“losers” did not change significantly. A control 
group of nondebators likewise showed no sig- 
nificant change in opinions. 


REFERENCES 


1. Cartson, E. Attitude change through modification 
of attitude structure. J. abnorm. soc. Psychol., 


1956, 62, 256-261. 
2. Comen, B. D. ef al. Experimental manipulation of 


veel behavior. J. exp. Psychol., 1954, 47, 106- 
10. 

3. Crockett, W. H. Attitude change as a function of 
cognitive differentiation and affect under condi- 
tions of norm-presentation with and without 
counter-arguments. Unpublished doctor’s disser- 
tation, Univer. of Michigan, 1952. 

4. Doos, L. W. The behavior of attitudes. Psychol. 
Rev., 1947, 54, 135-156. 

5. Hiwprum, D. C., & Brown, R. W. Verbal rein- 
forcement and interviewer bias. J. abnorm. soc. 
Psychol., 1956, 53, 108-111. 

6. Janis, I. L., & Kine, B. T., The influence of role 
playing on opinion change. J. abnorm. soc. 
Psychol., 1954, 49, 211-218. 

7. Kerman, H. C. Attitude change as a function of 
response restriction. Hum. Rel., 1953, 6, 185-214. 

8. Krecu, D., & CRUTCHFIELD, R. S. Theory and 
problems of social psychology. New York.: Mc- 
Graw-Hill, 1948. Chaps. 5-6. 

9. RosENBERG, M. J. Cognitive structure and atti- 
tudinal affect. J. abnorm. soc. Psychol., 1956, 
53, 367-372. 

10. Verrranck, W. S. The control of the content of 
conversation: reinforcement of statements of 
opinion. J. abnorm. soc. Psychol., 1955, 51, 668- 


676. 
Received August 30, 1956. 


IDENTIFICATION AND OBSERVATIONAL LEARNING FROM FILMS! 
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focused upon the process of “passive” 
learning. We wish to explore the condi- 
tions which govern the nature and extent of a 
viewer’s learning the contents of a dramatic 
production which he views in a motion picture, 

Modern psychology has assimilated thor- 

oughly the lesson taught by Thorndike and 
Dewey, that we learn by doing rather than by 
having actions demonstrated to us. While 
there are a few reports in the literature of 
“observational learning” among animals (3, 4, 
11, 12) all the leading learning theories empha- 
size that the to-be-learned response must 
somehow be made to occur in the learning 
organism, not just in a model, before its as- 
sociation with a given set of conditions can be 
strengthened. Yet it is evident that viewers of 
movies and television dramas do learn much of 
the content of dramatic programs, at least to 
the extent of being able to recall subsequently 
many details of the characters’ actions, and 
they sometimes incorporate material thus 
learned into their own overt behavior. There 
are instances, for example, in which criminals 
have employed techniques for crime taken 
directly from movie portrayals (1). 

Presumably, a viewer’s learning of the 
actions of movie characters is enhanced if he 
reproduces these actions himself, at least 
covertly, during the screen portrayal. Our 
problem in predicting which viewers will learn 
how much of what content then becomes one 
of understanding the conditions that affect a 
viewer’s tendency to match his own actions, 
at least covertly, to those a screen character 
portrays. 

We assume that when a viewer becomes 
absorbed in a dramatic production, he identifies 
himself, at least momentarily, with one or 
more of the characters. By identification we 
mean that the viewer, in fantasy, puts himself 
in the place of a character and momentarily 
feels that what is happening to that character is 
happening to himself. In this process, we as- 


T the studies reported here, our interest is 


1 This research was supported by a grant from the 
Laboratory of Social Relations, Harvard University. 
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sume that although he may reproduce very 
little of the gross motor behavior of his screen 
character (10) he does reproduce covertly? 
many elements of the behavior including the 
emotions he attributes to the character, so that 
when the character with whom he identifies 
himself is in a dangerous situation for example, 
the viewer feels fear, and when the character 
escapes danger, the viewer feels vicarious 
relief. We assume, too, that when a viewer puts 
himself in the place of a character, he orients 
toward the cues to which his character is 
responding. There may be misperceptions, of 
course, in which the viewer misjudges the 
nature of the cues to which the actor is re- 
sponding, or misinterprets the actor’s response, 
in which case the viewer learns something that 
is not an accurate representation of what the 
movie producer intended to portray. 

While it is possible for a viewer to shift his 
identification from one character to another 
during a movie, most viewers probably make a 
choice of a character whom they consider to 
be the main protagonist of the story, and see 
the action mainly from his point of view. In 
many movies, the plot centers so fully upon one 
main character that all viewers may be ex- 
pected to identify with him. But in other plots, 
there are two or more main characters sympa- 
thetically presented, and this makes it possible 
for viewers to differ in their choice of an 
“identificand.” We assume that the nature of 
the content a viewer learns is in part deter- 
mined by the choice of identificand, for the 
viewer presumably focuses his attention upon 
the stimuli that are relevant for his own char- 
acter, and shares vicariously the experiences of 
that character rather than those of the people 
playing opposite him. Thus, if A assaults B on 
the screen, the viewers who are identified with 
A should feel anger and dominance, while 
those identified with B should feel fear, and 
these differential experiences at the time of 
viewing should be reflected in the nature of the 

? Covert practice has been shown to be fully as effec- 


tive as overt practice in the learning of certain kinds of 
material from films. See Michael and Maccoby (6). 
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movie content that is learned. Of course, the 
matter of the ultimate incorporation of learned 
material into the viewer’s own overt actions is 
a complex matter. This would depend not 
only upon the original learning of the material 
and perhaps its rehearsal in later spontaneous 
fantasy, but upon an appropriate situation 
arising in which the stored-up material can be 
evoked. Undoubtedly, much material as- 
similated from the mass media remains as an 
unused potential. A viewer may learn from a 
movie how to act as a groom at a wedding, but 
if he remains unmarried, the learning is never 
manifested in his behavior. Our concern in the 
remainder of this report is with the first step 
of learning from films; that is, with the sub- 
ject’s ability to recall film content verbally 
after a brief interval. 

We expect, as noted, some individual dif- 
ferences in the choice of a movie character with 
whom to identify. We also assume that 
identification is a matter of degree, and that 
viewers differ in the “depth” of their identifica- 
tion with their chosen character at any 
particular moment. The deeper the identifica- 
tion with a character, the more should the 
viewer reproduce covertly, and therefore learn, 
the actions and feelings of that character. 

In order to predict what a given viewer will 
learn, we must predict his choice of main 
protagonist, and the depth of his identification 
with that character. There are a number of 
presumably relevant factors. One undoubtedly 
is the consonance of the viewer’s motives with 
the character’s actions. If a screen character is 
doing what the viewer most wants to do, then, 
presumably, the viewer will be more likely to 
identify with him. 

Another factor, we hypothesize, is that of 
similarity between the viewer and the screen 
character with respect to certain major social 
characteristics, such as sex, age, and race. The 
importance of similarity has been pointed out 
in connection with projective tests such as the 
TAT. Murray (9) has recommended that TAT 
pictures include a character of age and sex 
similar to that of the subject, to facilitate his 
projecting himself into a character. We believe 
that a similar process is involved in identifying 
with a movie character. While the fantasy 
sequence in a movie, unlike the TAT, is one 
over which the viewer has no control, the 
viewer can vary in the depth of his involvement 


in the story—the amount of attention he pays 
to it—and we are assuming that his interest will 
be more deeply engaged if the story has a 
central character who is similar to the viewer 
in some important and discernible aspect of the 
life roles he occupies. 

There are instances, of course, in which a 
viewer’s similarity to a character facilitates 
identification but in which the viewer’s motives 
are more consonant with the actions of a 
different character. For example, the character 
in the story who is engaged in the activities the 
viewer would most like to perform may be of 
the opposite sex or a different social status. 
Under these conditions, we cannot make a 
prediction about the outcome. 

Another factor which should affect the 
viewer’s choice of a protagonist and the depth 
of his identification is the extent of the viewer's 
anxiety about the actions which the character 
portrays. Our expectation that a viewer should 
choose a protagonist similar to himself should 
hold if the actions being carried out are not 
only attractive but socially acceptable. But if 
the actions are simultaneously attractive and 
anxiety arousing, then the viewer would 
probably choose an identificand dissimilar to 
himself (14). The vicarious enjoyment of 
tabooed activity may be somewhat less if is 
enacted by a dissimilar person, but the anxiety 
it arouses would be reduced as well, and more 
sharply (cf. 7). We suspect that anxiety over 
the expression of aggression is one of the 
sources of children’s preference for animal 
heroes and space men. 

In the present studies, we have focused 
primarily upon the factor of similarity. That is, 
we have attempted to study whether viewers 
do in fact identify themselves with the movie 
character most like themselves, given a movie 
in which the viewer has some choice of pro- 
tagonist, and we have attempted to deter- 
mine whether differential identification pro- 
duces predictable differences in the nature of 
the movie content learned. We have also given 
some attention to the matter of “consonance” 
between the motives of the viewer and actions 
of particular characters. We have not studied 
the third factor—displacement of a preferred 
agent of action due to anxiety—but have 
instead attempted to minimize the problem by 
choosing movies in which actions of a sympa- 
thetic character that might be potentially 
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anxiety provoking are justified in the plot so 
that guilt or fear will be minimal. Thus, when 
aggressive actions are carried out by a hero, 
they are justified by the villainy of a “bad 
man” and occur in a context of social approval. 


Srupy I: Socrat CLASS 


In studying the effects on identification of 
similarity between the viewer and a movie 
character, the first variable with which we 
worked was that of social class. The movie em- 
ployed had two leading characters: a lower 
class boy and an upper-middle class boy. For 
this movie, our hypotheses were that: : 

1. Lower class children tend to identify with 
the lower class character and middle class 
children with the middle class character. 

2. Viewers who identify with a character 
remember more of that character’s words and 
actions than viewers who do not identify with 
that character. 

3. Since both leading characters are boys, 
the predictions about identification apply 
more to boy viewers than to girl viewers, 

Our hypotheses deal with the simple choice 
of a protagonist, not with the degree of involve- 
ment with that character once the choice has 
been made. However, consideration of degree 
would lead us to the parallel predictions that 
the more similar the viewer is to the movie 
character the more deeply he identifies with 
that character, and therefore the more his 
learning is specific to the content that is rele- 
vant to his chosen character, 

We expected that our hypotheses might not 
apply uniformly throughout the movie. Specif- 
ically, the differences between the two groups 
of viewers (those identifying with the two 
different main characters) might be maximized 
either when both characters were on the screen 
at the same time, or when only one of them 
appeared on the screen. There were r 
be discussed, for either prediction. W 
planned to use this study not only to test the 
specific hypotheses listed above, but to explore 
the group differences in selected scenes and for 
selected kinds of content. 


Method 


The Movie. Two 20-minute episodes of a seria] 
movie entitled Junior G-Men were used in the study; 
it was a story about adolescent spy chasers made about 
1940. Its two main characters were an upper-middle 


easons, to 
e therefore 


class boy and a lower class boy, both of whom were 
about 16 years of age. Harry Trent had his own car, 
lived in the best part of town, and could pilot an air- 
plane and operate a radio transmitter. He was the 
leader of a group of boys who called themselves the 
Junior G-Men, and who were well dressed, polite, and 
had an elaborate club house. Billy Barton was the 
leader of a gang of “dead-end kids.” They had an im- 
provised club house in a junk yard and were ill- 
dressed and aggressive. Both Billy and Harry were ad- 
mired fighters, but Harry won fights because he was a 
skillful boxer and Billy because he was a “slugger.” 

Subjects. Thé subjects (Ss) for this study were ten 
classes of seventh-grade students from public schools 
in the greater Boston area. The classes included 269 
students—139 boys and 130 girls. Two or more classes 
were taken from each of four junior high schools serv- 
ing communities that differed widely in socioeconomic 
level. 

Each child in the study was asked to record his 
father’s occupation at the end of his test blank, and 
the entries were coded on the Warner seven-point scale 
of occupational status (13). The sample included 94 
children whose fathers’ occupations fell at the first 3 
steps of the Warner scale, 42 at step 4, and 126 at 
steps 5-7, 

Procedure. The movie was shown to pairs of classes 
during a regular class period. The experimenter (E) 
told the children that they were about to see an ordi- 
nary entertainment film. The Æ said that he wanted to 
find out something about what kinds of things children 
notice and enjoy in movies, and that he would ask 
them a few questions about their reactions to the 
movie later on. The two reels shown were not consecu- 
tive, and while reels were being changed, the Æ de- 
scribed the intervening developments in the plot. When 
the two reels had been shown, the Æ said that he would 
come back in a week to ask some questions. 

One week later, a multiple-choice questionnaire on 
the movie was administered. The items included ques- 
tions on how well the § liked the movie, and several 
questions which were intended to reveal identification 
with particular characters, as well as test items on re- 
call of the content of the movie. The Ss were told to 
make the best guess they could when they were not 


sure of an answer, and very few children omitted items 
in filling out the questionnaire, 


Results 


The first question of interest is whether 
children from lower-class families more often 
identified with the lower class hero than did the 
children with middle-class backgrounds. As we 
defined the term, identification meant the 
viewer’s putting himself in the place of a given 
character as he experienced the story. Clearly; 
it is difficult to obtain a direct measure of this 
Process. There was an attempt made to get an 
indirect measure by asking several questions 
concerning the S’s reactions to the mai? 
characters. We reasoned that a child would not 
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be likely to “put himself in the place of” a 
character whom he did not like and admire, 
since to do so would be to share disapproved 
characteristics. We also felt that the process of 
a viewer’s imagining himself in a particular 
role is facilitated if the character playing the 
tole behaves as the viewer himself would 
behave in a similar situation. We therefore 
asked three questions for our indirect measure 
of identification: 

L Which one of the two main characters 
did you like best? 

2. Which of the two main characters would 
you like to be like? 

3. Which of the two main characters do you 
feel is most like you? 

The answers to these three items 
combined into an identification index. In the 
following discussion, a child is considered to be 
a “Billy” identifier if he chose Billy (the lower 
class boy) for two out of three of the above 
items. He is considered a “Harry” identifier if 
he chose Harry (the middle class boy) for two 


out of three of the items. 
As may be seen from Table 1, the upper- 


middle class boys tended to choose Harry as an 
identificand and the lower SES boys tended to 
choose Billy. Among girls, a majority chose 
Harry, and this was true for girls at each social 
class level. Each of the items in the identifica- 


tion index contributed to the findings in 
Table 1. That is, each was positively related to 
social class among boys; the question “which 
character is most like you” contributed most, 
but the index correlated more highly with 
social class than did any individual item taken 
alone. 

In Table 1, our measure of social class is a 
rating based on the occupation of the child’s 
father. We asked each child not only about his 
father’s occupation but about the kind of occu- 
he child himself would like to have when 
r girls this proved tobe a 
e a number wanted to be 
ho did mention the 
ave tended to 


were 


pation t 
he was grown up. Fo 
difficult question, sinc 
housewives, and those W 
kinds of jobs they would like to h 
make choices such as secretary, nurse, teacher, 


or air-line stewardess which provided little 


range in aspired social class. For the following 


3 The three items had low positive intercorrelations, 
with coefficients of contingency ranging from .26 to 


-40. 
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TABLE 1 


RELATION OF SOCIOECONOMIC STATUS TO CHOICE 
or MOVIE CHARACTER* 


Socioeconomic Status 
(Warner Occupation Scale) 


soe es 
Upper Middle Lower Middle & 
(1-3) Lower (4-7) 


Boys 
Chose Billy 23% 56% 
Chose Harry 75 40 
Not ascertained 2 4 
Total 100% 100% 
Number of cases (47) (89) 
xX=144 p= <.001 
Girls 
Chose Billy 30% 33% 
Chose Harry 64 ól 
Not ascertained 6 6 
Total 100% 100% 
Number of cases (47) 9 (79) 
=<. 


x= 1 


* Not included in this table are 7 cases whose SES was not 


ascertained. 


discussion of aspired social class, therefore, we 
have considered boys only. 

It may be seen from Table 2 that, regardless 
of the social-class level of a boy’s family, if he 
aspires to a relatively high level occupation, he 
tends to identify with Harry, the middle-class 
boy, in the movie. Boys who expect to have 
working-class occupations tend to identify 
with Billy, regardless of the occupation level of 
their fathers. Among boys with similar aspira- 
tions, there is only a little variation in choice 
of character that can be traced to current 
objective social-class membership. The con- 
clusion is quite clear that a boy’s choice of 
screen character is more closely related to the 
social-class level to which he aspires than to the 
level his family currently occupies. 

Identification and the recall of movie content. 
Our hypothesis was that if a viewer is identified 
with a particular character in a movie, this 
fact should determine what events on the 
screen he perceives and remembers. Specif- 
ically, we predicted that viewers who identified 
with Billy should remember more of the 
things Billy said and did, and more of the 
things that happened to Billy, than should 
the Harry identifiers, and that conversely the 
Harry identifiers should remember more of the 
Harry-relevant material. 

To test this prediction, we first designated 
the items in the multiple-choice test which 
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TABLE 2 


RELATIONSHIP BETWEEN OCCUPATION TO WHICH Boy 
ASPIRES AND His CHOICE OF MOVIE CHARACTER 


TABLE 3 


RELATIONSHIP OF CHorce or Movie CHARACTER 
TO RECALL OF Movie CONTENT 


Number Number 
Present and Aspired Social Class Ghons= | Grogs 
Billy | Harry 
Present SES High (1-3, Warner’s 
Occupation Scale) 
Aspired occupation high 7 30 
Aspired occupation lowt : 3 0 
Present SES Low (4-7, Warner’s 
Occupation Scale) _ 
Aspired occupation Hie be a 
ired occupation low 
Aspire ie ae yaa 


* Based on 117 boys for whom this information was available; 
22 Ss did not provide the information. : 
} Too few of the high SES Ss aspired to low SES occupations 


to allow computation of x 


could be classed as “Billy content” or “Harry 
content.” The following exemplifies a Harry- 
relevant item: “In the plans for the boys to go 
to the Torchies’ headquarters and try to 
capture the gang before they escaped, what 
was Harry supposed to do?” This question was 
followed by a list of possible answers. A Billy- 
relevant item was as follows: “What did Billy 
say when he turned around and faced the man 
who had followed him out on the docks?” In 
all, there were 27 items that could be classified 
as “Billy content,” and 20 items for “Harry 
content.” The test had 67 items, some of 
which were not classified into either of these 
content categories. Each child’s test form was 
scored separately for the number correct on 
each of these two sets of items. Then each of 
these scores was divided by the child’s “total 
correct” score for the entire test in order to 
eliminate the effects of reading ability, IQ, and 
other factors that might contribute to total 
score but were not relevant to our study. 
Table 3 shows that Billy identifiers re- 
membered slightly more Billy-relevant ma- 
terial than did the Harry identifiers. The 
differences, though significant for boys,‘ are 
not large. The comparisons on Harry-relevant 
content show that both boys and girls who 
identified with Harry remembered slightly 


4 When the data are analyzed separately by each ad- 
ministration of the film, each administration being 
considered a replication, and the results combined, the 
Billy identifiers among the girls also remember more of 
the Billy content than do the Harry identifiers (p 


< .05). 


Category of Content 
Billy | Harry| Billy | Billy 
ott | eee Aas Aa. 
tent | tent | siont |" Siont 
Boys 
Mean Score* 
Billy Identifiers 44.0 |28.0 | 19.4 | 24.8 
Mean Score 
Harry Identifiers 42.1 |28.7 | 18.0 | 24.3 
t 2.25] 1.00) 2.48 <5) 
pt -02) n.s. -O1) ns. 
Girls 
Mean Score 
Billy Identifiers 44.6 |28.0 | 18.3 | 25.7 
Mean Score 
Harry Identifiers 43.3 |28.2 | 17.2 | 26.3 
t 1.10; — 1.36} .67 
pt ns. | n.s. | ms. | n:s. 


* Each S’s score is the proportion of all items answered cor- 
rectly which were classified into the content category in question. 
Thus, among Billy identifiers, 44% of the correctly answered 
questions Were Billy content items. 

t The “Billy content” items were subdivided into aggression 
and nonaggression. 

$ Al the p’s are one-tailed unless otherwise stated. 


more Harry-relevant content, but the dif- 
ferences are not significant. 

A possible reason why these differences were 
so small is that the hypothesis tested does not 
take into account the attention a viewer de- 
votes to his character’s cues. We assumed that 
if a viewer is identified with a particular 
character he attends particularly to that 
character’s actions and remarks. But it is 
probably also true that he attends to the 
stimulus properties of his character’s environ- 
ment. If, for example, the screen character is 
being approached by a dangerous animal, the 
viewer probably looks at the animal rather 
than the story hero. We could predict, then, 
that the viewer might notice and remember 
the cue properties of the hero’s situation as 
well as (or better than) the actions and words of 
the hero himself. Thus, if the two main 
characters are on the screen simultaneously, 
and interacting with one another, each char- 
acter’s actions would be a cue for the other, and 
there should therefore be few differences in 
memory between the viewers who identify 
with each of the two characters. 

To investigate this possibility, we subdivided 
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it a acnorting to whether the op- 
with the a a Was Pee and mtieracung, 
is aracter asked about in the item. 
ten the Billy identifiers remembered 
Reg se — or the Billy content when 
ets l a the screen with Billy. They 
es Te slightly, but not significantly, 
eseni 4 m when the two boys were 
talkin A u illy’s action did not involve 
g to Harry. When Billy was talking to 
Harry, the Harry identifiers remembered the 
Billy content somewhat (not significantly) 
better,’ For boys, then, the results fit the inter- 
Eer that when both characters are pres- 
interacting, the nature of the remem- 
bered content is not a function of identification 
choice, since everything that happens is either 
the action of one’s own character or a cue to 
which he responds. 

We have shown so far that the different 
groups of viewers remembered much the same 
things from the movie but that the differences 
obtained favored the hypothesis that similarity 
of aspired-to class fosters identification with a 
given character, and that selecting one 
character for primary identification results in 
better learning of that character’s actions and 
words. A major problem exists, however, in 
determining whether the differential learning 
is actually a result of identification. There is an 
alternative explanation that Billy identifiers 
remembered somewhat more Billy activity 
because Billy engaged in the kind of activities 
that were most interesting to the kind of boy 
who would choose Billy as an identificand, and 
that it was the interest in the activities per se, 
rather than identification with Billy, that 

roduced the increment in learning. It is pos- 
sible that Billy identifiers would have re- 
membered more Harry material, if only the 
items about Harry had happened to con- 
centrate upon his aggressive, motoric actions 
instead of his more restrained, verbal ones. 

This alternative explanation is difficult to 
test, since we cannot guess what all the dif- 
ferent sources of attraction for Billy-identifiers 
might be, nor do we have test items bearing 
upon them all. Partial tests were nevertheless 
possible with respect to aggressive versus non- 
was a tendency for the Billy identi- 


ntage over Harry iden- 
ly’s talking to 


5 For girls, there 
fiers to show the greatest adva 
tifiers in the content which involved Bill 


Harry. 
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aggressive content. Starting with the pos- 
sibility that lower-class nonmobile children 
had a higher level of aggressive motivation 
than the other children (see 5) we considered 
whether the fact that Billy’s activities in the 
movie were much more aggressive than Harry’s 
could account for the greater learning of Billy 
content by Billy identifiers. We divided the 
test items into those dealing respectively with 
aggressive and with nonaggressive activity. 
Too few items on .Harry aggression were 
available to make a test for Harry content, but 
we could compare the recall of Billy-aggressive 
content with Billy-nonaggressive content. As 
Table 3 shows, the superiority of Billy identi- 
fiers in recalling Billy content applied to the 
aggressive items only. Parenthetically, it may 
be of interest that boys recalled the aggressive 
content better, and girls the nonaggressive 
content, both differences being significant at 
the .01 level. Thus we find that the Billy 
identifiers did not show an advantage in recall 
of all kinds of Billy content, but only those 
selected aspects that were presumably conso- 
nant with their own motives. This finding 
would not be predicted from our view of 
identification, for when a viewer puts himself 
in a character’s place, he should experience 
vicariously everything that character does and 
feels, regardless of whether each item of the 
character’s behavior touches upon one of the 
viewer’s major motives. 

From our existing data we cannot tell 
whether identification has contributed to our 
results, or whether we are simply observing the 
effects of selective perception based upon need. 
If both identification and need-consonance 
were required for maximal recall of movie 
material, we should find that the Billy iden- 
tifiers did not recall Harry aggression especially 
well, even though they are presumably set to 
select aggression, because the aggression is not 
carried out by their own preferred protagonist. 
In the absence of test data on Harry aggres- 
sion, we cannot answer the question. More 
evidence bearing upon this problem is available 


from Study II. 


Srupy II: SEX 


In this study, we are concerned with the 
differences between boys and girls in their 
identification with movie characters and their 
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memory of movie content. Continuing with 
our investigation of the hypothesis that 
viewers tend to identify with characters 
similar to themselves in major social categories, 
we sought a movie which would provide fairly 
comparable opportunities for identifying with 
either a male or female character. Our predic- 
tion was that viewers would choose to identify 
with the like-sexed character, and that the 
movie content they learned would be in part a 


ion of this choice. 
© specific hypotheses Study II was 


~~~ designed to test were as follows: 


1. Boy viewers tend to identify with the 
boy character and girl viewers with the girl 
character. 

2. Viewers who identify with a character 
remember more of the words and actions of 
that character than viewers identifying with 
the opposite character. This prediction applies 
only to words and actions of a given character 
that are mol directed toward the opposite 
character. 

3. Viewers who identify with a character 
remember more of the stimuli directed toward 
that character than those viewers who do not 
identify with that character. (Stimuli include 
words and actions of secondary characters 
directed toward the identified-with character, 
as well as other environmental cues to which he 
is responding.) 

4. There is no difference between the viewers 
who identify with the two primary characters 
in the amount of material they remember 
when the two characters are interacting with 
each other. 

5. Boys remember more of the aggressive 
content of the movie than do girls. 


Method 


The movie. The film was entitled Tomboy, a Holly- 

wood feature made about 1940. The two primary char- 
acters were a boy and a girl, aged 14 or 15, who were 
pupils in a rural school. The story concerned a new girl 
in the community and a boy living with a cruel uncle, 
-the developing interest of the boy and girl in each 
other, and the working out of the conflict between the 
boy and his uncle until the denouement with a happy 
ending. 

Subjects. The Ss in this study were 7th-grade pupils 
in three public schools in the Boston metropolitan area. 
The schools served quite varied communities: in one 
school, over half the fathers’ occupations were classified 
in Warner’s classifications 1, 2, and 3; in a second 
school, 24 of the fathers’ occupations were in Warner’s 


classifications 5, 6, and 7; in the third school, fathers’ 
occupations were rather evenly distributed over the 
various Classifications. 

Administration. In each school, the movie was shown 
in an auditorium to all the 7th grade pupils. The Æ 
told the children that they were about to sce an ordi- 
nary entertainment movie; he said that he wanted to 
find out something about what kinds of things children 
notice and enjoy in movies, and that he would ask 
them a few questions about their reactions to the movie 
later on. One week later the Es returned with question- 
naires. The pupils remained in their classrooms and 
were tested one classroom at a time. 

Measuring instruments. The measuring instruments 
were comprised of two parts: one designed to provide 
an indirect measure of identification; and the second 
designed to measure learning of the content of the 
movie, In addition, each S was asked to indicate his 
sex and his father’s occupation. 

As in Study I, we do not have a direct measure of 
the S’s “identification” with a movie character, but 
employed an indirect measure intended to reveal as- 
pects of each role’s attractiveness for fantasy role play- 
ing. Since we found in the first study that the question 
“which character did you like the best” was not as 
useful in predicting the recall of movie content as the 
other items in our identification index, we omitted this 
question, and used the following questions: (a) “Which 
character is most like you?”; (b) “Which character 
would you like to be in real life?”; (c) “Which part 
would you like to play in the movie?” The Ss were 
given five choices—the two primary characters and 
three secondary characters—and were asked to indicate 
their first two choices to each question. Their answers 
were combined into an index, described below. 

There are four kinds of content relevant to the hy- 
potheses guiding this study: (a) behavior, including 
actions, speech, and feelings, of one primary character 
not directed toward the other primary character (des- 
ignated “Alone” content); (b) behavior of one primary 
character directed toward the other primary character 
when they are in a scene together (designated ‘“Inter- 
action” content); (c) stimuli directed toward one of the 
primary characters but not by the other (designated 
“Stimulus” content); and (d) behavior of a primary 
character directed toward a secondary character in an- 
ger and/or with the purpose of injuring him either 
physically or psychologically (designated “Aggressive” 
content). 

Three different procedures were used, on different 
groups of Ss, to measure the learning of content: 

The first was a multiple-choice “recognition” test, 
similar to that used in Study I. This test was given to 
291 students. In Study I there were disappointingly 
small differences in recall scores between viewers iden- 
tifying with the two main characters. This fact sug- 
gested that our measuring instrument was relatively 
insensitive, and in Study II we therefore tried out tw 
additional kinds of measuring devices in an effort to 
obtain sharper differences. One was a “recall” test, in 
which the Ss (47 in number) were required to write in 
their own answers to “open” questions. Another was 2 
“reconstruction” test, given to 37 students, to be de- 
scribed more fully below. 
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Results and Discussion 


Identification. An “identification index” was 
computed on the basis of the answers to the 
three identification questions; this index indi- 
cated the degree to which one of the primary 
tian was seen as a more attractive role 
; a the other. In computing index scores, 

ach first choice was given a value of two, and 
each second choice a value of one. The index 
was equal to the score for the primary boy 
character minus the score for the primary 
girl character. Index scores showed very little 
Pross sex identification, as may be seen in 
> 4. We can therefore test our hypotheses 
ee of simple sex differences. All the 
hs wing comparisons are thus in terms of 
Oys versus girls. 
woe As is so often the case at this age 

ith verbal materials, the girls performed at a 
Somewhat higher level of skill on all three 
measures of learning of movie content. For 
example, on the recognition test the girls got 
an average of 39.5 items correct, as compared 
with an average of 38.0 for the boys; this 

ifference is significant at the .03 level. To 
allow for this difference and within-sex differ- 
ences in total score, all the raw scores for each 
category of content were converted into per- 
centage scores on the basis of the individual 
S’s total number correct. The results for the 
three different types of measuring instruments 
are considered separately below. 

Recognition. The recognition test was com- 
Posed of 64 multiple-choice items about the 
Content of the movie. There were four kinds of 
Content relevant to our study, and some items 
in each of these content categories concerned 

© male leading character, some the female. 

e items were selected in terms of (and 
Classified into) the following eight categories: 

» Boy Alone—boy character’s actions an 


TABLE 4 
Ibenrircation wira Boy Ann GirL Movie 
CHARACTERS BY SEX OF VIEWER 


Choose Boy Chome Gut 


Character 


No Choice 


“satifcation Index | 6| s| 4] 3| 2] 1) 9 j~ 

ore 

Number of Boys s| 3] 2) 1) 0) 0 
7| 13} 26| 23| 37| 53 


44]34|46]17|20]12] 13 
Number of Girls 


oj 1} af 1) 0] 6| 7 
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words when the girl was not present. 11 items. 
(Example: “How did Steve react when his 
Uncle Mat told him he couldn’t go to school 
anymore?”) 

2. Boy Interaction—boy character’s actions 
and words directed toward the girl. 7 items. 
(Example: “When Pat and her father came by 
to see Uncle Mat on Sunday afternoon, what 
did Steve say to Pat when he first got out to the 
car?”) 

3. Boy Stimulus—stimuli directed toward 
the boy character but not by the girl. 11 items. 
(Example: “Steve was talking to Pat and her 
father on Sunday morning and suddenly said, 
‘Gee, it’s late. I’ve got to run.’ What reminded 
him he had to go?”) 

4. Boy Aggression. 7 items. (Example: 
“How many times did Steve hit Harry?”) 

5. Girl Alone. 7 items. (Example: “When 
Pat was getting lunch ready on the picnic, how 
did she show that she missed Steve?”) 

6. Girl Interaction. 7 items. (Example: 


“When Pat and her father stopped by at 
hat did Pat 


Steve’s house after the picnic, W. 


say to Steve?”) y 
7. Girl Stimulus. 7 items. (Example: “When 


Pat went out the back door on Sunday morning 


looking for her father, what did she see?”) 
8. Girl Aggression. 7 items. (Example: 
«What did Pat do while the tramp was holding 


her?”) 
We predicted that boys would remember 


more aggressive content from the film than 
girls. This proved to be true. (See Table 5-A.) 
Furthermore, girls remembered more Girl 
Alone content, which was predicted, and more 
Girl Interaction content, which was nol pre- 
dicted. There were no significant differences on 
the other categories of content, although all 


were in the predicted direction. 
Recall. The instrument used in the recall 


testing was composed of items that asked a 
question about the content of the movie and 
provided a space for the S’s response. The 
items were selected from ones used in the 
recognition procedure, the stems being slightly 
reworded. There were five items from each of 
the eight categories of content. In scoring the. 
recall questionnaire, two points were given for 
each essentially correct item, one point given 
for a partially correct item, and 0 points for 
each incorrect item. Points were added to 
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TABLE 5 
Sex DIFFERENCES IN MEMORY OF MOVIE CONTENT 


Content Categories 
Boy Boy Girl Girl Girl Girl 
Alone [sumalus |Asgres-| Imer | Gine | gill | anges: | Inter 
A: Recognition Test* 35 3 
d ent score (N = 153) 18.0 16.6 8.6 107 iS 13.7 DT (iL. 
‘en rate Ber Gat score (V = 138) 17.9 | 16.4 | 7.4 | 10.7 12.3 | 14.2 9.5 112.1 
33 -53 | 3.50 -17 | 2.28 | 1.44 -56 | 2.12 
ot n.s. n.s. .001| n.s. .01 30 | ns. .05} 
B: Recall Test* 
’ t score (N = 27) 16.00 | 10.1 |19.3 6.9 112.7 12.7 10.6 |13.2 
con rach Be con uscare iy = 20) 14.50 | 8.2 |13.6 | 8.1 |10:9 | 15.1 |102 |195 
t -78 1.36 | 3.98 .81 | 1.11 1.19 2 | -3.15 
ti n.s. .10 -001| n.s. n.s. .12 | n.s. .005¢ 
: tion Test: Unit Classification* 
ag Ber cent score (V = 20) 31.4 | 19.6 | = | 12.7 hoo | 8.5 = 14.2 
Mean Girls’ per cent score (V = 17) 26.2 | 17.5 = |} 42:3 BSI | 10.5 = 1.3.5 
t 2.32) 1.17 -20 | 3.38 | 1.70 64 
ot <.02 | <.13 ns. |<.001) <.05 n.s. 


* Each S’s score is the number of items correct in a content category divided by the total number correct on the entire test. 


t All p’s are one-tailed tests unless otherwise indicated. 
t P is two-tailed: no direction was predicted. 


provide a point score for each category of 
content and a total point score. 

On the recall test, boys remembered more 
Boy Aggression content than did girls, and 
girls remembered more Girl Interaction con- 
tent (see Table 5-B). Differences here are much 
larger than for the previously discussed 
recognition test. The differences obtained for 
Boy Aggression and Girl Interaction scores 
were at the same or higher level of statistical 
significance even though the test was shorter 
and only about 14 the number of Ss used. 

Reconstruction. The reconstruction testing 
procedure involved the use of still photographic 
prints of scenes from the movie. The Ss were 
to respond to each stimulus picture by writing 
a paragraph or two telling what was happening 
at the moment of the picture, what had hap- 
pened just before the picture, and what 
happened just after the picture. They were 
individually given nine 3 x 444 inch photo- 
graphic prints, with four minutes to write 
about each. Three pictures depicted the 
primary boy character alone or with secondary 
characters; three pictures depicted the primary 
girl character alone or with secondary char- 
acters, and three pictures depicted the two 
primary characters, boy and girl, together. 

6 Unfortunately, in printing, the pictures were re- 


versed from right to left. The Ss did not report being 
confused about the scenes on this account, however. 


The scoring of the resulting protocols was 
in terms of a “correct and relevant subject- 
predicate unit”; that is, each subject-predicate 
unit that actually occurred in the movie in the 
specific scene from which the picture was taken 
was counted. In addition, a score of one unit 
was given for the mention of material—no 
matter what the quantity in terms of subject- 
predicate units—that did not occur in the 
specific scene in question but was relevant to it 
either as a direct antecedent of the scene or a 
direct consequence of it. Material that actually 
occurred in the movie but was not directly 
relevant to the scene in question was not 
counted, nor was material that did not occur in 
the movie. 

Each unit was classified if possible in one of 
the eight previously defined relevant categories. 
Less than two per cent of the content of the 
Protocols could not be so placed. Aggressive 
material made up such a small proportion of 
the content of these protocols (about 3%) that 
it is not considered in the comparisons. 

Half of the protocols were scored by two 
independent scorers. Interscorer relia’ es 
were computed by means of rank-order cor“ 
relations. The reliability of the total scores was 
:90, while the reliability of the scores on the 
nine individual pictures ranged from .81 to .95 
with a median of .86, and that of the scores for 
the six subcategories of content ranged from 
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65 to .94 with a median of .80. Score-rescore 
1 reliabilities of a single scorer were somewhat 
) higher. 

fi The results obtained through this procedure 

y may be analyzed in two ways: first, in terms 

- of the individual unit classification, and 
secondly, in terms of the picture classification. 
In the first instance, boys and girls are com- 
pared on the basis of the score for each category 
. of scorable units (Boy Alone, etc.), without 
regard for the classification of the picture for 
which the units were written. In the second 
instance, boys and girls are compared on the 
basis of the scores on each type of picture: Boy 
Alone, Girl Alone, and Interaction, without 
regard for the classification of each particular 
unit scored for that picture. 

g An the first analysis, boys obtained sig- 

nificantly higher scores on Boy Alone content; 
girls obtained significantly higher scores on 
Girl Alone and Girl Stimulus content; and 
there were no significant differences between 
boys and girls on Boy Stimulus, Boy Interac- 
tion, and Girl Interaction content (see Table 
© 5-C). It should be noted that the differences 
i “obtained with the use of the reconstruction test 
are larger than those obtained on the recogni- 
tion test. 

In the second analysis, boys obtained 
statistically significant higher scores on the 
Boy Atone: pictures; girls obtained signif- 
icantly higher scores on the Girl Alone pictures; 
and there was no significant difference between 
the boys and girls on the Interactive, pictures. 
(See Table 6.) : 

Combined results. Since the content categories 
for the three testing procedures were defined in 
essentially similar ways and the three pro- 
cedures were used on independent sets of 
Ss, it is permissible to combine the probabilities 
obtained on the three procedures by the Stouf- 
~ fer technique (8). The results are shown in 
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TABLE 6 


Sex DIFFERENCES IN MEMORY FOR MOVIE CONTENT 
RECONSTRUCTION TEST: PICTURE CLASSIFICATION 


Classification of Picture 
Boy Girl | Inter- 
Alone | Alone | active 
Mean Boy per cent score | 35.3 | 27.0 38.3 
(N = 20) 
Mean Girl per cent score | 30.4 | 33.9 35.6 
(N = 17) 
t 2.43 3.52 1.21 
p* <.01 | <.001 | <.25 


* P’s are one-tailed tests except for the “Interactive” which 
is two-tailed. 


Some of our original hypotheses are sup- 
ported; others are not. We predicted that boys ` 
would remember more of the Boy Alone and 
Boy Stimulus content, and this proved to be 
the case. We also predicted that boys would 
remember more aggressive content, but we 
found that they remembered the aggressive 
content better than the girls only if the boy 
hero was the agent of aggression. As far as the 
girls are concerned, they remembered more 
Girl Alone and Girl Stimulus material, as 
predicted, but their greater memory for “girl” 
content did not extend to aggression, even 
when the girl was the agent of aggression. 
Furthermore, we predicted that there would 
be no difference between the sexes in their 
recollection of content in which the boy and 
girl were interacting, on the grounds that any _ 
action on the screen would constitute either, 
an action of one’s own character or a relevant 
cue for his behavior. We found, however, that 
girls did remember, better than boys, the 
words and actions of the girl character when 
they occurred in a context of interaction with 
the boy. 

These findings suggest certain revisions of 
our hypotheses, still further in the direction 


Table 7. 
J TABLE 7 
Í COMBINED PROBABILITIES OF SEX DIFFERENCES OBTAINED IN THREE PRoceDURES OF Stupy II 
Content Categories 
| Boy Boy Boy ; Girl Girl Girl 
Bı 4 Girl =. 7 č 
Aigke | timer] Aggres- | Inter | Aione | SHR | fon | action 
Recognition Test Z (N = 47) 0.32 | 0.52 | 3.50 0.16 2.27 | 1.43 | 0.56 2.12 
\ Recall Test Z (N = 47) 0.77 | 1.34 | 3.68 —0.80 | —1.09 | 1.17 | 0.25 2.90 
| Reconstruction Test Z (V = 37) 2397 | LIS | = 0.20 3.08 | 1.65] — | —0.63 
Combined Z 1.91 | 1.74 | 5.10 —0.25 2.46 | 2.45 | 0.57 2.54 
<.03 |<.05 |< .001 <.50| <.01 |<.01 |<.30} <.01 


Combined  (one-tail) 
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indicated in Study I. Memory for movie 
content appears to be influenced both by 
identification with a character in the movie 
and by the “need relevance” of a particular 
kind of content for the viewer. In cases where 
both identification and relevance are acting 
in the same direction, one should expect the 
greatest differences between boys and girls; 
where identification and relevance are acting 
in different directions, one should expect a 
minimum of difference between boys and girls; 
and where only identification is affecting the 
situation, the differences should be inter- 
mediate. If we assume that aggressive material 
is particularly relevant for boys, .nd boy-girl 
interactive material for girls.’ the differences 
would be almost exactly of the order we find 
in Table 7, with Boy Aggression and Girl 
Interaction showing substantial differences, 
and Girl Aggression and Boy Interaction 
showing little or no differences. 


SUMMARY 


Our hypothesis was that the amount and 
kind of material a viewer learns from a movie 
is in part a function of his choice of a character 
in the movie with whom to identify himself. 
We expected that a viewer would tend to 
identify with a character similar to himself in 
major role characteristics (e.g., sex, age, Or 
status) whenever the movie provided more 
than one sympathetically presented character. 
And we reasoned that the viewer, having 
chosen a character as the primary protagonist 
through whose eyes to experience the action 
of the movie, would remember more of that 
character’s actions and words, and more of the 
stimuli to which that character was responding, 
than would the viewer who was identified with 
a different character. 

In two studies we showed movies to 25 
class rooms of seventh-grade children, and 
tested them a week later on their knowledge of 
movie content. 3 

Using an indirect measure of identification, 


7 Our assumption about aggressive material being 
more relevant for boys, and romantic material for 
girls, is consonant with the findings of Dysinger & 
Ruckmik (2). These workers took PGR readings of 
children watching movies, and found a higher level of 
emotional response among boys to danger and conflict 
scenes, and among girls, to love and erotic scenes. 


we found that viewers did indeed identify 
themselves with the like-sexed leading char- 
acter, in viewing a movie which included both a 
strong male and strong female lead. With 
respect to similarity of social class, however, 
viewers were more likely to choose the pro- 
tagonist whose social class corresponded with 
the viewer’s aspired social class, rather than 
his current objective status. 

As far as memory for movie content was 


concerned, we found that both the choice of 


a character with whom to identify and the 
“need relevance” of that character’s actions 
were important. That is, viewers do tend to 
remember somewhat better the actions and 
words of the character with whom they 
identify, and the stimuli relevant for that 
character’s actions, but the advantage in 
learning does not apply equally to all the 
actions of the viewer’s chosen character. For 
boy viewers, aggressive content appears to be 
particularly relevant, for they remember 
aggressive content better than girls, provided 
its agent is the boy hero. For girls, boy-girl 
interactive content appears most relevant, and 
they remember this content better than boys 
whenever the girl heroine is the agent of the 
action. f 
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among patients afflicted with physically 
crippling illnesses or injuries, the psycho- 
logical reaction to the trauma seems to bear 
little relationship to the extent or degree of 
the disability. Some patients with minimal 
physical incapacitation demonstrate a catas- 
trophic reaction, a feeling of complete devasta- 
tion, while others with extensive physical 
involvement are able to accept the prospects 
of permanent severe crippling and adjust to 
their handicap without evidence of marked 
personality disruption. Perhaps this difference 
in psychological reaction to a change in body 
structure may be accounted for, at least in 
part, by differences in the way individuals con- 
ceive of their own bodies. The concept of body 
image as a system of ideas and feelings which 
the individual has about his physical structure 
has had frequent application in the psychologi- 
cal literature on poliomyelitis and other dis- 
abling conditions. It has been pointed out (1, 5, 
6, 9, 10) that disablement imposes upon the 
individual the necessity for a more or less 
drastic modification of his body image, an 
abrupt change in his perception of himself as a 
physical being. Difficulties in adjustment 
are often conceptualized as difficulties in incor- 
porating the atrophied body, the facial dis- 
figurement, or paralyzed limb into the basic, 
pre-existing body image. The ease or difficulty 
with which this integration takes Place is an 
index of the success which one has with one 
aspect of the problem of readjustment. 
Although body image is a concept often used 
in discussions of the psychological implications 
of physical handicap, there have been few at- 
tempts to derive systematic measurements 
relating to this phenomenon. Fisher and Cleve- 
land (2, 3, 4), however, have described a 
method of obtaining measurements on one 


i has been a general observation that 


1 Southwestern Poliomyelitis Respiratory Center, is 
aided by an annual grant from the National Foundation 
for Infantile Paralysis, Inc. 
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dimension of body image. This dimension, j 
called the body-image boundary, has to do with, 

the degree to which the individual conceives Ss 
of his body boundaries as definite and firm 
versus indefinite and weak. Its measurement 
is expressed in terms of what is called the 
Barrier score or the total number of a particular 
type of responses (called Barrier responses) } 
produced by the individual on the Rorschach 4 
test. In addition to their published findings on l 
normals and various psychosomatic symptom 
groups, Fisher and Cleveland have determined 


in unreported exploratory studies that degree | 
of boundary definiteness is significantly linked 4 


with ability to tolerate stress in normal 
subjects and with facility of readjustment to 
body mutilation in individuals who have 
suffered limb amputation. 4 

The concept of body-image boundary shoul: č 
have especially marked implications ir & l 
disease such as poliomyelitis in view of the ract 
that probably no other disease carries with it 
the actuality or threat of more severe physical i 
incapacitation. It is conceivable that successful 
adaptation to such a disability may be posi- 
tively related to the strength or firmness of the 
body-image boundary. The individual who 
already has a body image with a firm, definite, 
and protective boundary should be better able 
to withstand the threat incurred from physical 
insult, to incorporate the damage, and to re- 
establish the body-image boundary. 

Within this framework, a study was designed 
to test the hypothesis that patients who 
achieve the better psychological adjustment 
to the physical disability imposed by polio- 
myelitis show, through a greater number of | 


sdi 


Barrier responses, 
more definite body- 
tients with a less 


dividual’s physical 


tion of his illness at the time of testing, is 2 


BOoDY-IMAGE BOUNDARIES AND ADJUSTMENT To POLIOMYELITIS 


function of a more pervasive body-image 
factor in the personality which may dispose 
the individual toward a successful or unsuc- 
cessful adaptation to physical handicap. 


METHOD 


The subjects (Ss) were 56 patients hospitalized in a 
poliomyelitis respiratory center which cares for patients 
in both the acute and postacute stages of illness. The 
group included all types of poliomyelitis cases: spinal, 
bulbar, and bulbo-spinal, both with and without re- 
Splratory muscle paralysis. It included cases with 
widely varying physical involvement, ranging from rela- 
tively mild to severe with a predominance of severe 
cases. It was regarded as representative of the usual 
patient population of the respiratory center. The group 
included 29 males and 27 females. The age range was 
from 15 to 43 years with a median of 26; education 
varied from 3 to 20 years with a median of 13; and 
Wechsler Verbal IQs (available on all but 4 patients) 
ranged from 72 to 145 with a median value of 114.2 

In order to obtain the sample, files were examined 
on all patients, both current and discharged. Sixty Ss 
were found whose Rorschach protocols contained the 
minimum number of responses necessary for scoring the 
body-image variable. These patients were then evalu- 
ated in terms of adjustment to illness by procedures 
which will be described later. In the process of this 
evaluation, four patients were eliminated because of 
disagreement between judges as to adequacy of adjust- 
ment, leaving a total of 56 Ss. 

The Rorschach test had been administered individ- 
ually to each of these patients for general clinical 
assessment purposes prior to the inception of this 
study. In all cases the test had been given by an 
examiner who at the time had no knowledge of the 
method of determining body-image values from the 
Rorschach. Since the test had been given more or less 
routinely to adult patients, there was no indication of a 


selective factor in the administration. 
Twenty-eight of the 56 Ss had been tested by the 


senior author and at the time of the study were either 
still in the hospital or had only recently been dis- 
charged. The remaining 28 Ss had been tested and 
subsequently discharged prior to the senior writer’s 
employment at the respiratory center. These two 
groups, which were treated separately as independent 
samples, are referred to as Groups I and II, respec- 
tively, Adjustment ratings were made in somewhat 
different fashion for the two groups and involved differ- 
ent sets of judges. It was of interest to determine 
whether a relationship between body-image data and 


? This high median intelligence level should not be 
taken as an indication of selective sampling. The 
Wechsler-Bellevue Verbal Scale has been given to more 
than a hundred adult patients at the respiratory center 
by various examiners during the past three years, and 
the median IQ for these data corresponds very closely 
With the median IQ of 114 reported for the Ss used in 
this study. Apparently adult poliomyelitis patients, at 
least those treated in this center, are as a group above 
average in intelligence. 
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adaptation to disability occurred in both samples. If 
so, it would suggest a degree of stability in the rela- 
tionship. 

Groups I and II were each subdivided into two 
categories relating to the adequacy of adaptation to 
illness. The method used was that of gross clinical 
evaluation. In the case of patients in Group I, three 
stafi physicians who knew the patients well and a 
psychologist (the present senior author), made inde- 
pendent gross evaluations of the 32 Ss on whom Ror- 
schach records of sufficient length were available. These 
evaluations were made prior to inspection of the Ror- 
schach protocols with reference to the body-image 
variable. Judges were asked to divide this group of 
patients into two categories of approximately equal 
number, placing those who had shown the most satis- 
factory adaptation to their illness in one group, and 
those with the least satisfactory adaptation in another. 
No attempt was made to define for the judges the 
criteria for satisfactory or unsatisfactory adjustment. 
They were simply told to use their global clinical im- 
pression of the patient’s behavior during his period of 
hospitalization in allocating him to one or the other of 
the two categories. In 28 out of the 32 cases, at least 
three out of the four judges agreed as to placement of 
the patient. These 28 individuals composed Group I. 
The “satisfactory” and “unsatisfactory” categories thus 
identified included 15 and 13 patients, respectively. 

Because the 28 patients in Group II had been dis- 
charged for some time, it was felt that staff members 
would not remember them well enough to make an 
evaluation of their adjustment while in the hospital. 
However, an index of adjustment was already available 
in the records of these patients. Soon after the patient’s 
discharge, the social worker at the Center had rated on 
a 5-point scale the adequacy of the patient’s overall 
psychological adjustment to his illness during the 
course of his hospitalization. This evaluation in a sense 
represented the pooled judgments of the entire staff. It 
was essentially an integration by the social worker of 
her own impressions and the opinions expressed by 
various members of the staff during a number of con- 
ferences on the patient and in the course of informal 
contacts between the social worker and other staff 
members while the patient was in the respiratory cen- 
ter. This entry, known as “Patient’s Response to Dis- 
ease (PRD),” is one dimension of the “Social Service 
Profile,”’® which has been in use at the Center for a 
number of years. It is completed on each patient by 
the social worker as a routine procedure. For purposes 
of illustration, the definitions of the two extremes of 
the PRD scale are shown as they appear in the Profile: 

PRD O. Degree of anxiety and apprehension 
present does not seem out of proportion to the 
neurological involvement; cooperates to the extent 
of his ability. His responses to illness and hospitali- 
zation do not interfere with his acceptance of treat- 
ment. Can utilize his residual abilities rather than 
withdrawing and becoming completely dependent on 


3The “Social Service Profile’ was designed by 
Maurine B. Mitchell, Director of Medical Social Work, 
Southwestern Poliomyelitis Respiratory Center and 
Wolff Home Poliomyelitis Rehabilitation Unit. 


others; is considerate of the rights and privileges of 

others (within his physical limitations). 

PRD 4. Physiologically unjustified amount of 
extreme anxiety; insecurity and apprehension pre- 
vent cooperation; may result in patient becoming 
either fearful, belligerent, hostile, or rebellious; fails 
in making acceptable adjustment to hospitalization 
and separation from family; complete obstruction of 
treatment program. 

For the ees in Group II, dichotomizing the 
distribution of PRD scores into categories of 0-1 and 
2-4 yielded two groups of 14 patients each, which were 
designated as “satisfactory” and “unsatisfactory” ad- 
j mt groups. 
reece en tte on the total sample and on the four 
subgroups are presented in Table 1. N 

The method used for scoring body-image boundary 
definiteness from the Rorschach has been described in 
detail elsewhere (2). The basic assumption underlying 
the scoring procedure is that the definiteness of the in- 
dividual’s body-image boundaries is reflected in the 
manner in which he describes the peripheries of his ink 
blot percepts. Definite boundaries are linked with de- 
scriptions which emphasize the protective, decorative, 
covering, containing, and concealing attributes of per- 
cept peripheries. The following are examples of Ror- 
schach responses which exemplify the attributing of 
such special qualities to the periphery: “cave with 
rocky wall,” “mummy all wrapped up,” “man in suit 

of armor,” “fort with high walls,” “woman with fancy 
decorative dress.” The number of responses of this sort 
given by an individual in his Rorschach record is taken 
as an index or score of boundary definiteness and is re- 
ferred to as the Barrier score. It has been shown that 
the Barrier score can be determined by scorers with a 
high degree of objectivity (2). 

Because the Barrier score is probably correlated 
with the total number of Rorschach responses pro- 
duced, it was necessary to control variations in response 
total as much as possible. This was done by using only 
records with at least 15 responses and by reducing to 
25 responses all records with more than 25. In the case 
of records with R greater than 25, this procedure in- 
volved using only the first three responses on each of 
the first five Rorschach cards and the first two re- 
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sponses on each of the last five cards. When there was 
less than the set quota of responses on any given card, 
it was compensated for by taking an extra response on 


the next card in the sequence for which there was more 
than the set quota. 


After the records had been scored for Barrier re- 
sponses, using this method of controlling R, the group 
of 56 Ss was dichotomized at the median of the Barrier 
score distribution to yield a high- and a low-Barrier 
group. It was found that these two groups each had a 
Tesponse total of 25. Although the range of responses 
was 15-25, actually only a few cases in each group fell 
below 25, so that the median in both cases corresponded 
to the upper limit of possible response totals. Since the 
medians were the same, it may be concluded that there 
was no significant difference between high- and low- 
Barrier Ss with respect to R. 

Intelligence was a second factor to be considered in 
addition to R as possibly affecting the production of 
Barrier scores. The median IQ for the high-Barrier 
group was 114 and for the low-Barrier group 113.5. 
Accordingly, it was assumed that intelligence did not 
contribute to the difference in Barrier scores between 
the two groups. 


RESULTS 


Testing the hypothesis, which asserted a 
relationship between body-image boundary 
and adjustment to illness, required a compari- 
son of “satisfactory” and “unsatisfactory” 
groups with respect to the frequency of Bar- 
rier scores produced. As shown in Table 2, the 
median Barrier score for patients in subgroups 
characterized by the more satisfactory adjust- 
ment seemed to exceed that of patients who 
made less satisfactory adjustments. Within 
each of the two samples (Groups I and II), chi 
square was used as the test of significance m 
comparing “satisfactory” and “unsatisfactory 
patient groups with regard to the number of 
Ss above and below the median of the Barrier 
score distribution. The Barrier score frequency 


TABLE 1 
HAVING “SATISFACTORY” AND 


” NT 
AND OF THE Tone reel “UNSATISFACTORY” ADJUSTME: 
Group and Adjust- Sex Age Educati - statis Tat 
sh es os N ucation W-B Verbal IQ Testing* 
M | F | Mdn Range | Mdn Range | Mdn | Range | Mdn Range 
Stobp I 
atisfactory 15 6| 9| 26 20~ 
a A Aa i 34 | 12 | 9-17 | 113 | 76-129 3 | 1-14 
Gamp 7 3) 6] 7| 26 | Gous | 32 9-17 | 113 | 72-132 8 | 1-47 
Satisfactory 14 | 10] 4 25 18-38 
4 15 | 3-17 | 11 = -14 
Unsatisfactory Ae | P) | on | aa | ae 6-20 119 HEE ú 48 
Total group 56 |29| 27| 26 | 15-43 | a3 3-20 | 114 | 72-145 7 | 1-18 
* By Rorschach test. 
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joe TABLE 2 
ieee AND RANGE OF BARRIER RESPONSES FOR 
i JECTS CLASSIFIED AS HAVING “SATISFACTORY” 
ND “UNSATISFACTORY” ADJUSTMENT AND FOR 
THE TOTAL Group 
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TABLE 3 
COMPARISON OF PATIENTS CLASSIFIED As HAVING 
“SATISFACTORY” AND “UNSATISFACTORY” 
ADJUSTMENT ON FREQUENCY OF 
BARRIER RESPONSES 


Group and Category Median Range 

Group I o 
Satisfactory 4 1-7 

nsatisf, 

Ga ts ‘actory 2 1-7 
Satisfactory 4.5 0-8 
Unsatisfactory 2.5 1-5 

Total group 3 0-8 


was distributed in categories of 0-3 and 4 or 
more responses. Correction for continuity was 
— in both analyses. Results of these proce- 

res are shown in Table 3. 
, Examination of Table 3 reveals that the 
incidence of Barrier scores in both Groups I 
and II was in the predicted direction with pa- 
tients in the “satisfactory” categories showing 
the greater frequency. In the case of Group I, 
the difference in Barrier score frequency 
between the “satisfactory” and “unsatis- 
factory” categories was significant at the .11 
level and for Group II at the .06 level of 
confidence. Since the assumption of indepen- 
dence of experimental outcomes was satisfied, 
the two probabilities were combined to yield a 
composite chi square (8, pp. 46-47). The 
probability associated with this chi square 
would be the aggregate probability of obtain- 
ing the given set of results from two inde- 
pendent tests of the relationship between the 
adjustment and body-image variables. The 
composite chi square was found to be 10.06. 
With four degrees of freedom (twice the num- 
ber of probabilities involved), this value for 
chi square falls between the five and the 
two per cent levels of significance. Thus, al- 
though results of the two independent analyses 
did not prove to be strictly significant, the 
collective results support the hypothesis. In 
other words, it would appear that people who 
show evidence of definite body-image bound- 
aries, in comparison with those having 
indefinite boundaries, tend to adjust sig- 
nificantly better to the experience of physical 
disability resulting from poliomyelitis. 

The question remains of whether the Barrier 
Score reflects a relatively well established body- 
image factor in the personality. or is simply 


Satisfactory | Unsatisfactory 
Group A t 
Above | Below | Above | Below 
Mdn | Mdn | Mdn | Mdn 
$ 9 6 3 10 | 2.58 11 
pag 9 5 3 11 |3.65| .06 


a function of the degree of physical incapacita- 
tion or duration of illness at the time of testing. 
Three steps were taken in examining this 
problem. The chief physician was asked to 
pick from the total group the 15 patients whom 
he considered to be the most incapacitated 
from the physical standpoint, and the 15 
patients with the least physical involvement. 
This procedure was used because accurate 
ratings on degree of physical disability could 
be made only for extremes. Chi square was 
used as before in comparing the two groups 
with respect to the number of individuals with 
Barrier response totals above and below the 
median of three. The results showed no differ- 
ence in Barrier scores for groups characterized 
by different degrees of physical involvement. 

As a second step, the relationship of Barrier 
scores to duration of illness at time of testing 
was studied in the total group by means of 
chi square. It was found that length of illness 
at time of testing for high-Barrier Ss did not 
differ significantly from the duration of illness 
at time tested for low-Barrier Ss. Median 
durations for the high- and low-Barrier groups 
were 7 and 8 months, respectively. It should 
also be noted that Fisher and Cleveland, in a 
previous study (2), found no relationship 
between Barrier score and duration of illness 
in a group with rheumatoid arthritis and a 
group with neurodermatoses. 

These observations are consistent with the 
argument advanced here, but it is evident from 
Table 1 that both of the groups rated “un- 
satisfactory” had been ill for a considerably 
longer period of time when tested than those 
rated “satisfactory” on adjustment. This 
raised the question of whether the significant 
relationship between Barrier score and adjust- 
ment might be contaminated by the variable 
of duration of illness. In order to control the 
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possible influence of this variable, a matched 
pair technique was utilized as a third step in 
the analysis. 

It was possible to select from the total 
sample a group of 15 “satisfactory” Ss, each 
of whom was closely matched (within one 
month) with respect to duration of illness by a 
patient among the “unsatisfactory” Ss. The 
median duration at time of testing was 7.5 
months, and the range was 1 to 14 months for 
each of the two groups thus obtained. Chi 
square, corrected for continuity, was used to 
compare the two groups with regard to the 
number of Ss above and below the Barrier 
score median. With length of illness at time 
tested held constant, high-Barrier individuals 
still fell more frequently in the “satisfactory” 
category (.06 level of confidence). This result 
did not quite reach the conventional criterion 
for statistical significance, but it was regarded 
as a fair indication that the Barrier score is not 
(or at least not primarily) a function of how 
long the patient had been ill when he was 
tested. 

It might be noted, incidentally, in speculat- 
ing as to why the more poorly adjusted patient 
was tested at a later date in his hospitalization 
than someone with a better adaptation to his 
illness, that the most plausible explanation is 
that the patient who made a poor adjustment 
was simply a more disagreeable and more 
difficult individual to work with and was likely 
to be resistant to testing. There was probably a 
tendency to delay and procrastinate in seeing 
this patient, especially since it was known in 
most cases that he would be available for 
testing over a period of several months of 
hospitalization. 


Discussion 


The results confirm the major hypothesis 
involved in this study and also furnish evidence 
that would seem to make the argument ad- 
vanced for the nature of the Barrier score a 
tenable point of view. The correlation found to 
exist between the body-image variable and 
adjustment seems to indicate that the extent to 
which the individual conceives of his body 
exterior as having a defensive, shielding, 
barrier-like quality plays some part in the 
total constellation of forces affecting the 
adequacy of the individual’s adaptation to 


physical crippling. Apparently, the individual 
whose body image includes these qualities is 
better able to cope with the problems imposed 
by his handicap than is the person whose body 
concept lacks these characteristics, 

It is readily evident, in the light of the 
considerable degree of overlap in Barrier score 
frequently between “satisfactory” and “un- 
satisfactory” groups, that the body-image 
variable, at least as presently defined, possesses 
little diagnostic or predictive efficiency. Never- 
theless, the obtained relationships suggest 
that the concepts involved are theoretically 
fruitful and merit further inquiry. 

In general, the Barrier score seems to reflect 
some kind of a body-image factor in the 
personality tending to dispose the individual 
toward a favorable or unfavorable adjustment 
to physical handicap. This study, of course, 
made no direct test of thiscontention, and such 
a test would be difficult to make. However, if 
contemporary circumstances affect the produc- 
tion of Barrier responses, it would seem that 
the duration and severity of illness would 
have influenced the Barrier score in the group 
studied. Since the results indicate that the 
Barrier response is probably independent of 
these variables, the overall observations at 
least seem congruent with the idea that the 
Barrier score represents a personality factor 
of a fairly basic nature. 

The correlation found between adjustment 
and production of Barrier responses is quite 
consistent with results obtained by Fisher and 
Cleveland in a study of relationships between 
the Barrier score and certain personality 
variables (3). They found that the Barrier score 
in normals is related to such characteristics 
as greater participation in active sports, 
higher » Achievement scores, the production of 
TAT themes involving identification with 4 
more definite image of the parents, and more 
emphasis on task completion. They concluded 
that emphasis on the Barrier aspect of body- 
image boundary is “a reflection of a style of 
life based on an unusually strong definition of 
self-identity and on objective self-expressio® 
aimed at establishing a stable, controlling 
relationship to the environment.” This implies 
a degree of self-confidence that should serve 2% 
a bulwark against a threat to self-integration- 
The individual possessing these characteristics 
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oe should be better able to absorb the 
co to the self involved in physical loss or 
ie Ing and to integrate this loss into a 
vised self concept with the minimum of 
Psychic trauma. 
6 ba le be noted that the body-image 
Clippled, already proved meaningful in another 
tion of ] group with respect to the incorpora- 
and CI ae Inan unpublished study by F isher 
ine i and on data supplied by Haber (7), 
den ound that high-Barrier amputees dealt 
fete realistically with phantom limb ex- 
might ces than did low-Barrier amputees. One 
igh-B conceptualize this as indicating that 
grate ae people were better able to inte- 
kareh body mutilation into their body- 
chema. 
wee experimental observations lend sup- 
eins < the theoretical significance generally 
divid e oes the body image as a factor in an in- 
Would S adaptation to physical handicap. It 
individ therefore, that the study of the 
bromia attitude toward his body is a 
adine way of approaching the problem of 
ships h ent to disability. Meaningful relation- 
ment an. been demonstrated between adjust- 
erha nd the boundary aspect of body image. 
ivete it would be rewarding to extend 
into gations concerning physical handicap 
other dimensions of the body concept. 


SUMMARY 


Ma study has shown, in a group of 56 
Psych X elitis patients, a relationship between 
an Ological adjustment to physical handicap 
md a Measure of the boundary aspect of body 
This measure, derived from the 
protocol, is known as the Barrier 
Of thaj. PParently, individuals who conceive 

“Ir body boundaries as possessing defen- 
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sive, armoring, barrier-like qualities are 


significantly better able to adapt to the cir- 
cumstances surrounding physical loss than are 
those individuals whose body concept does not 
contain these features. The investigation of 
other dimensions of body image is suggested 
as a promising approach to the study of 


adjustment to handicap. 
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THE MANIFEST ANXIETY SCALE AS A MEASURE OF DRIVE 


ROBERT E. SILVERMAN 
New York University 


=£ Taylor Manifest Anxiety Scale 

(MAS) (16, 17) was designed to provide 

a technique for distinguishing drive level 
in humans in order to test aspects of Hullian 
learning theory. It was assumed that the scale 
measured differences in emotional responsive- 
ness which contribute to the total effective 
drive state. According to Taylor (17), the title 
of the test should not be construed as suggest- 
ing special clinical validity since the scale was 
not designed for clinical purposes. In addition, 
Taylor and her associates have not been con- 
cerned with the relationship of anxiety per se 
to performance. 

In spite of the original purpose of the MAS, 
many investigators have sought to relate the 
scale to clinical criteria of anxiety (3, 6, 8, 12); 
others have persisted in attempting to relate 
anxiety to performance variables G,.2,.5, 11, 
15, 18). Thus, many investigations owe their 
origins to Taylor’s recent work and to Hullian 
theory. If these many studies have not fur- 
nished clear answers, they have served an 
heuristic purpose in highlighting the following 
questions: (a) What are the relationships 
between drive level and performance? (b) Does 
anxiety intensify effective drive level? (c) Does 
the Taylor MAS identify anxiety? (d) Does 
the Taylor MAS identify differences in drive 
level? 

The first question continues to be exten- 
sively treated by students of Hullian theory, 
while the second and third questions await an 
unequivocal criterion of anxiety. The purpose 
of the present study was to consider the fourth 
question directly since the assumed relation- 
ship between the MAS and drive has been the 
focal point of a variety of recent investigations, 

If one is willing to argue that drive may be 
defined solely in terms of the MAS, then the 
question becomes superfluous. However, it 
would seem that the intention of the Iowa 
group was to relate general drive variables to 
performance, not simply MAS scores to per- 
formance. In this light, the question becomes: 

What are the relationships between the MAS 
and measures of emotional responsiveness? If 
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the MAS is a measure of emotionality, then, 
according to Taylor, it should be related either 
to a “chronic emotional state” in which sub- 
jects (Ss) demonstrate high levels of emotion- 
ality or to differences in the potentiality for 
arousal, manifested by different degrees of 
emotional reactions to novel or threatening 
situations. r 

Palmar skin conductance and conductance 
changes were employed as measures of emo- 
tional responsiveness in the research to be 
reported. This technique, recently described by 
Schlosberg (14), has the multiple advantages 
of simplicity, reliability, and accuracy in esti- 
mating general states of autonomic nervous 
system activity.! 

Because two recent investigations have 
failed to demonstrate a linear relationship 
between the MAS and the GSR or between the 
MAS and palmar perspiration technique (1, 9); 
steps were taken to explore the conditions that 
might obscure such relationships by employing 
two psychometric inventories. The two inven- 
tories were the MAS and a forced-choice 
version of the MAS designed by Heineman 
(7). The forced-choice inventory was include 
to provide some control over one of the know? 
sources of error in the MAS, the influence ° 
social desirability variables (7, 12). 


METHOD 
Subjects 


a 
The Ss were 85 male undergraduate students aes 
the elementary psychology course at the Unive a 
College of Arts and Sciences at New York Universi 
The Ss were administered an inventory containing ot 
Taylor MAS and the K scale from the Minnes e 
Multiphasic Personality Inventory as a part of jol 
weekly small group discussions held in conjuncti 
with the lecture course. The skin conductance reading i 
were taken in a laboratory setting bearing no po 
temporal, or otherwise obvious relationship to the te ry 
In all, 66 Ss participated in the MAS and the laborat 


d, inde 
1A study by the author, recently completed, on 


cates weck-to-week reliability coefficients for the a 
ductance measures ranging from .68 to .74. In additi 
conductance changes were found to be systematica 
related to shock stress. 
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phases. Six weeks after the MAS was administered, the 
Heineman forced-choice version of the original Taylor 
scale (7) was given again in the discussion meetings, 
this time to 77 of the original Ss? The attrition of Ss 
was due mainly to class absences and failure to keep 
appointments. 


Apparatus and Procedure 


mang Sepparalus for measuring skin conductance con- 
a 50 microampere meter in series with two 
quarter-sized _silver-silver, chloride-saline paste elec- 
odes This is essentially the unit outlined by Schlos- 
E (14). The circuit was run by a single one-and-one- 
alf volt flashlight battery which, with the aid of a 
Potential divider and a calibrated resistor, permitted 
absolute conductance to be read directly. In order to 
insure consistent electrode placement from S to S, and 
to maintain optimal contact during the experimental 
ca the electrodes were attached to S by having 
him grasp a cylindrical two-inch wooden bar on which 
the electrodes were seated in rubber cups. A second 
padded bar was then firmly clamped over each hand. 
Thus, S was in a kind of hand stock which permitted 
finger movements without disturbing the electrode 
contact. 

Each S was introduced into the experimental setting 
by being placed in the stock and instructed to relax 
and rest while the experimenter (E) recorded his skin 
temperature. (This subterfuge was employed in order 
to obviate any concern on S’s part with electric cur- 
Tents and possible shock.) During this rest period of 
ten minutes, Æ recorded two conductance levels every 
30 seconds, one from each electrode. This was done to 
insure the obtaining of the optimum conductance read- 
ing. Only the highest reading from each pair of read- 
ings was used in computing the mean conductance 
level for a given S. 

Following this rest pe 
perform a simple series o. 


addition problems presented 
in front of them. One group of Ss (N = 36), selected 


at random, was assigned to the shock-threat condition. 
They were fitted with an arm band containing two 
electrodes and were instructed that, 
During the next few minutes you are going to be 
shown some numbers on this screen. You are to add 
the numbers as quickly as possible... but also as 
as you can. If you are not performing 
a shock . . . something like this. 
(A sample shock from an Applegate Constant Current 
Simulator set to deliver 2 milliamperes was given 
here.) The no-shock Ss (N = 30) were given the same 
task instructions but were not fitted with the electrodes 
and were not threatened with shock. 

The E recorded two conductance readings immedi- 
ately preceding the presentation of each problem. The 
entire problem series lasted 1214 minutes. 


riod, all Ss were instructed to 
f tasks consisting of 25 simple 
on a small screen directly 


accurately 
well, you will receive 


RESULTS AND DISCUSSION 


The mean conductance scores for the rest 
periods were transformed into logarithmic 


2Heineman’s FC-1 was used, scored according to 
Key 2 as described in the original report (7). 
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units for the purpose of statistical compari- 
sons.’ The shock and no-shock task conditions 
provided measures of arousal which were com- 
puted in terms of the changes in log conduct- 
ance from the rest periods to the task periods. 

The correlations of the MAS scale and the 
forced-choice scale (FC) with the log conduct- 
ance readings are shown in Table 1. None of the 
MAS correlations approaches significance. 
However, the FC scale does show a reliable if 
slight relationship with the rest period readings 
(r = .24) and a clear negative relationship 
with the conductance changes under the threat 
of shock (r = —.50). This negative relation- 
ship indicates that high scores on the FC scale 
are associated with low degrees of arousal. 

An explanation for the fact that the two 
scales, the MAS and the FC, consist of similar 
items but do not correlate similarly with the 
conductance measures, may be found in Table 
2 where the relationships among the MAS, 
the FC, and the K scales are shown. The K 
scale was designed to be a measure of “test- 
taking attitude” (10). High K scores are 
thought to be associated with a type of defen- 
siveness against psychological weakness, while 
low K scores may indicate excessive self- 
criticism. Thus, the correlation of —.55 
between the MAS and the K scale may be an 
indication that some low MAS scores were 
associated with defensiveness and some high 
MAS scores were simply reflecting a too candid 
or critical attitude. The FC scale, although 
correlating reliably with the MAS, did not 
show a relationship with the K scale. 

These findings suggest that the failure to 
demonstrate a correlation between the MAS 
and the conductance measures may have"been 
due to the spurious nature of some of the MAS 
scores. When steps were taken to minimize the 
effects of the test-taking attitude by employing 
a forced-choice technique, a relationship was 
found between the questionnaire and the 
physiological measure. Thus, an inventory like 
the Heineman scale appears to be a more 
sensitive measure of emotional responsiveness 
than the MAS. 

It was observed above that conductance 


3 In addition to being normally distributed, log con- 
ductance changes have been found to correlate well 
with measures of palmar perspiration. Mowrer et al. 
(13) reported a relationship between palmar perspira- 
tion, measured by a colorimetric technique, and meas- 
ures of “psychological” tension during psychotherapy. 
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TABLE 1 


CORRELATIONS BETWEEN THE Two SCALES AND LOG 
CONDUCTANCE READINGS 


Mas | FC 
| £ SE r SE 
4 11 
Conductance: rest -07| .11 -24 g 
Conductance change: no |—.14| .18 | 12 -20 
hock z 
Gontuctancs change: == 17| 316 |- 50**| .14 
shock | E 
* Significant at the .05 level. 
** Significant at the .01 level. 
TABLE 2 
INTERCORRELATIONS AMONG THE THREE SCALES 
FC K 
MAS a 
: (N= 69) 
FC — 


** Significant at the .01 level. 


change is inversely related to the FC scale 
under the shock threat condition. This suggests 
that persons defined as anxious by the scale are 
less aroused by a threatening situation. These 
findings are in direct contradiction of the 
hypothesis that high levels of emotional re- 
sponsiveness are associated with increased 
potentialities for arousal. Nevertheless, an 
interesting and not unfamiliar relationship is 
indicated which points up possible shortcom- 
ings in the conception of anxiety as a drive. 
Silverman and Blitz (15), using rote learning 
measures, found that high anxious Ss (defined 
by MAS) were less affected by threat condi- 
tions than their low anxious controls. Deese, 
Lazarus, and Keenan (5) similarly reported 
that the rote learning of high anxious (MAS) 
Ss was not reliably affected by shock stress, In 
both of these Studies, the dependent variable, 
rote learning, did not offer any clue in regard to 
other behaviors possibly associated with threat 
and related to rote learning. 

Yet it is unlikely that anxious persons are 
generally unresponsive to threat Situations. It 
may be that the threat conditions of the labora- 
tory are relatively benign by comparison to the 
personal doubts and fears which anxious per- 
sons habitually carry with them. Expressed in 
terms of this study, one might say that the 
tensions engendered by the laboratory threat 


did not raise an already high state of emotion- 
ality. A more general conception is that in- 
creases in arousal depend upon the already 
existing state of tension. This conjecture is 
supported by the fact that in the present study 
the amount of conductance change was a func- 
tion of conductance level during rest. The 
product-moment correlation between rest level 
and change was .63. Darrow (4) also found that 
the size of the autonomic reaction (skin resist- 
ance) is a function of the amount of “available 
energy” which a given individual has at his 
disposal. Thus, it would seem that persons 
whose tension levels are high initially do not 
show much arousal and, hence, are unable to 


cope with novel situations which demand 
increases in arousal, 


SUMMARY 


This study was concerned with the relation- 
ship between the Taylor MAS and skin con- 
conductance. A group of Ss was given the MAS 
and six weeks later the Heinman forced-choice 
version of the scale. Skin conductance meas- 
ures were obtained for each S under two condi- 
tions: (a) a rest period, and (b) a task period 
involving shock threat for some Ss, and no 
shock threat for others. , 

The results indicated that the forced-choice 
scale was positively related to the readings 
taken under rest condition, and negatively 
related to changes in conductance obtained 
under the threat of shock. The MAS did not 
correlate with any of the conductance read- 
ings. . 

These findings were explained by considering 
the high correlation between the K scale and 
the MAS and by noting that anxiety has been 
found by other investigators to be associated 
with low arousal. 
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GICAL MEANING OF ACQUIESCENCE SET FOR 
ee AUTHORITARIANISM? 


N. L. GAGE, GEORGE S. LEAVITT, anp GEORGE C. STONE? 


Bureau of Educational Research, University of Illinois 


HIS paper is directed against the argu- 
ment that the California F (predisposi- 
tion to fascism) scale (1) and similar 
inventories have their validity lowered by 
their loading with acquiescence set. The 
argument runs as follows: (a) the F scale 
consists entirely of “negative” statements, 
i.e., statements that, when agreed with, reflect 
authoritarian attitudes; (b) the tendency to 
agree with ambiguous and unstructured state- 
ments has high internal consistency and 
generality over tests of markedly different 
content (7); (c) the F scale, and other in- 
ventories of attitudes about social issues and 
interpersonal relationships which employ agree- 
disagree items, are significantly loaded with 
irrelevant variance in acquiescence tendency. 
Thus, Cohn (5) found the F scale signif- 
icantly correlated (.41) with his Plus scale; the 
latter measures the number of times one 
responds “true” to 33 MMPI items dealing, 
according to Cohn, with nothing in particular. 
Similarly, Bass (4) found the F scale cor- 
related positively with an acquiescence factor 
obtained by factor analysis. 


A CORROBORATIVE STUDY 


The present authors have obtained similar 
results in an independent study. We used 
Cohn’s Plus Scale, the F scale, the Minnesota 
Teacher Attitude Inventory (MTAI) (6), and 
an especially constructed “Information-True” 
test of acquiescence. These were administered 
to 118 graduate students in education at the 
University of Illinois. 

The Information-True test consisted of 50 
“difficult” true-false information items. These 
items had been written and found through pre- 
testing to be sufficiently difficult and obscure to 
elicit approximately a 50-50 split of “true” and 
“false” responses. The difficult items were 


1 This investigation was supported in part by a re- 
search grant (M-650) from the Institute of Mental 
Health, National Institutes of Health, Public Health 
Service. 

2 Now at Fresno State College. 

3 Now at G. D. Searle & Co., Skokie, Illinois. 
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mixed with 40 “easy” items; the latter were 
written and found through pretesting to be 
at about the 75 per cent level of difficulty. (The 
corrected split-half reliability of the number- 
correct scores on the easy items was .35; the 
latter score served as a crude measure of 
information about history, science, civics, 
Gallup Poll data, geography, and the like.) 

The corrected split-half reliability of the 
acquiescence score, number of “true” responses 
on the difficult information items, was .68- 
This z indicated that the test was successful 
in eliciting reliable individual differences in 
tendency to respond “true,” or acquiescence 
set. That difficult and ambiguous items are 
required to elicit the acquiescence set is again 
demonstrated by the fact that the reliability of 
the acquiescence score on the easy information 
items was only .09. 

Table 1 shows the correlations among the 
Cohn Plus Scale, the Information-True scores 
the MTAI, and the F scale. These 7s Cor- 
roborate previous findings. The significant 
positive correlation (.19) between our cme’ 
tion-True score for acquiescence and Cohr 
Plus Scale shows that acquiescence tendency ? 
indeed general over considerably different yea 
of test content. The correlation probably wou: 
have been higher if Cohn’s Plus Scale wer? 
not concerned with another variable, namely 
tendency to self-disparagement. That is, oy 
most of Cohn’s 33 MMPI items, the “true 
response seems also to be the unfavorable 
description of oneself. This impression a 
supported by the judgments of two gradua i 
students and two clerical workers who We, 
asked to indicate whether “true” or “false 
would constitute an unfavorable self-descrP” 
tion in response to the 33 items. Their 12 
pendent judgments agreed unanimously tha 
“true” was the self-disparaging response 
almost all the items. d 

Table 1 also shows that the Plus scale 20° 
the Information-True score correlate neg” 
tively (—.29 and —.26, respectively) with T 
MTAL. These negative rs were expected on t 
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TABLE 1 
CORRELATIONS AMONG ACQUIESCENCE, MTAI, AND 
F SCALE 
(N = 118) 
Measure Mean] SD | ree rs 
1. Acquiescence: o @) | @ 
Plus Scale 12.7 | 5.1 | .76t |.19°|—.29°*] .44°°* 
2. Acquiescence: 
Information-True | 27.2 | 5.7 -68f —.26°*| .35°** 
3. MTAI 53.8 | 32.4 | .86f —.62°°* 
4. F scale 49.8 | 23.8 | -83t 


* Significant at the .05 level. 

** Significant at the .01 level. 
*** Significant at the .001 level. 
t Corrected split-half. 

t Kuder-Richardson formula 20. 


grounds that (a) the acquiescence tendency is 
general over highly diversified types of test 
content, and (b) the MTAI scoring weights are 
negatively correlated with degree of agreement 
to its items. The latter point is apparent from 
the published scoring keys for the MTAI. Of 
the 274 +1 scoring weights in the MTAI key, 
60 per cent are assigned to “disagree” and 
“disagree strongly” responses, while only 21 
per cent are assigned to “agree” and “agree 
Strongly” responses. Hence, if acquiescence set 
is general over tests, acquiescent subjects 
should get low MTAI scores. Our two signif- 
icant negative rs support this reasoning. 
Table 1 further shows that both acquiescence 
tests correlate positively with the F scale. The 
r between the Plus scale and the F scale is 44; 
that between the Jnformation-True score and 
the F scale is .35. These positive 7s were 
expected by the same reasoning as that given 
for the negative rs between acquiescence and 
the MTAI. On the F scale, however, the 
scoring weights correlate positively (+1.00) 
with degree of agreement on the items. Our 
positive 7s confirm those of Cohn and Bass. 
The final r in Table 1 is that between the 
MTAT and the F scale. This r should be nega- 
tive because both tests are affected by ac- 
quiescence tendency but score it in opposite 
directions, and because the content of the two 
tests is fairly similar. The obtained r was — 62. 


PSYCHOLOGICAL SIGNIFICANCE OF 
ACQUIESCENCE 


So far, our data merely support previous 
iescence set’s 


findings concerning the acqui 


internal consistency, generality, and effect on 
test intercorrelations. At this point, however, 
we can choose between two interpretations of 
these findings. That is, granted that the 
California F scale and the MTAI are loaded 
with acquiescence set, we can Say either (a) 
that acquiescence is psychologically irrelevant 
to the authoritarian attitude measured by the 
F scale and the MTAI, or (b) that acquiescence 
is part and parcel of the authoritarian attitude. 
The latter position, developed by H. J. Leavitt 
and others (13), is that which we shall adopt, 
extend, and further test in the remainder of 
this study. 

Not only the MTAI and the F scale are 
involved in this issue. As Bass (4) has indi- 
cated, measures of many other attitudes are 
implicated. These measures involve attitudes 
about social relations, morals, prejudice, 
custom, and status relations; examples are 
“dogmatism,” “misanthropy,” “xenophobia,” 


and “suggestibility.” 


Positive and Negative Items 


The act of agreeing. Let us assume that 
acquiescence tendency reflects a genuine 
aspect of the authoritarian disposition. Then 
h a statement should, other things 
reflect authoritarianism. But 
suppose the statement agreed with is one that 
expresses a nonauthoritarian attitude, ie, 
is a “positive” statement. Examples of positive 


statements can be found in Bass’s G scale, and 


in Ancona’s $ Scale (2); both of these were 


made up of items opposite in meaning to the 
original statements of the F scale. Examples of 
these items are the following: “The artist and 
the professor are much more important to 
society than the businessman and the manu- 
facturer.” “Human nature being what it is, 
universal peace will come about eventually.” 

The MTAI contains 38 such “positive” 
items, where agreement may be considered to 
reflect a nonauthoritarian attitude. Examples 
from the MTAI are: “Minor disciplinary 
situations should sometimes be turned into 
jokes.” “Unquestioning obedience in a child is 
not desirable.” “There is too great an emphasis 
upon ‘keeping order’ in the classroom.” 

As shown in Table 1, the act of agreeing to 
positive items can occur either (a) “logically,” 


agreeing wit 
being equal, 
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TABLE 2 
HYPOTHESIZED EQUIVOCALITY OF RESPONSE TO POSITIVE 
ITEMS AND UNIVOCALITY OF RESPONSE TO 
NEGATIVE ITEMS 


Response 
Type of 
i Agree Disagree 
Logical Meaning: Non- | Logical Meaning: Au- 
authoritarian | thoritarian 
Rosie: | pevchologicel Meadiue: | Peyckolorieal Meaning: 
Authoritarian | Nonauthoritarian 
Logical Meaning: Au- | Logical Meaning: Non- 
thoritarian authoritarian 
Negative’ | psychological Meaning: Psychological Meaning: 
Authoritarian | Nonauthoritarian 


because the person genuinely has the non- 
authoritarian attitude which the content of the 
item represents, or (b) “psychologically,” be- 
cause he tends to acquiesce. Now, if acquies- 
cence reflects obeisance to authority, i.e., a 
form of authoritarianism, then reasons a and b 
have opposite psychological significance. The 
“logical?” and “psychological” reasons for 
agreeing are in a sense opposed to each other. 
Response to a nonauthoritarian statement may 
therefore be considered to have equivocal 


meaning for the personality of the subject 
involved. 


“logically” 
and “psychologically,” Examples of negative 
items are all those of the F scale, such as: 


classes, the weak and the strong.” “Familiarity 
breeds contempt.” “War and social troubles 
may someday be ended by earthquake or 
flood that will destroy the whole world.” 

There are 112 such negative items in the 
MTAI: “If the teacher laughs with the class in 
amusing classroom Situations, the class tends 
to get out of control.” “4 child should be 
taught to obey an adult without question.” 
“Discipline in the modern School is not as 
strict as it should be.” 

The act of disagreeing. On Positive items, 
disagreeing may occur (c) “logically,” because 
the person is authoritarian and so genuinely 
rejects the nonauthoritarian attitude which 
the content of the statement represents, or (d) 
“psychologically,” because he is nonauthori- 
tarian and so does not tend to acquiesce ir- 
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rationally when confronted with a difficult or 
ambiguous statement. Here the person does 
not have the kind of obeisance to authority of 
the printed word which may be considered to 
reflect an aspect of the authoritarian personal- 
ity. The nonauthoritarian is not negativistic; 
he simply does not agree as often on merely 
temperamental grounds, and his alternative 
therefore is to disagree more often. For positive 
items, again, reasons c and d have opposite 
significance. 

On negative items, however, the act of 
disagreeing has the same significance for 
authoritarianism whether it is a genuine 
response to item content or a temperamental 
response representing autonomy, nonyielding, 
and the like. In short, agreeing or disagreeing 
with negative items has relatively univocal 
meaning for the authoritarianism of the person 
being tested. 

Hypothesis. By this reasoning, we arrive at 
the hypothesis that negative items have greater 
validity than positive items for the measurement of 
authoritarianism. Psychologically, disagreeing 
requires more “ego strength,” “emotional 
security,” and “self-confidence,” than does the 
act of agreeing. Coupled with the logical signif- 
icance of a nonauthoritarian response, i.e., 0 
disagreeing with a negative statement, the 
meaning of the response is clearly nonauthor!- 
tarianism. Psychologically, agreeing may mean 
yielding to the authority of the printed word oF 
a similar submission to the power and plaust- 
bility of the social context in which the test 15 
given. Coupled with the logical significance of & 
nonauthoritarian response, i.e., of agreeing 
with a positive Statement, the resultant mean- 
ing of the response is relatively unclear; it may 
mean either authoritarianism or nonauthorl- 
tarianism. When both positive and negative 
items are similar in content and degree a 
ambiguity, the latter items should have highe 
validity for measurement of authoritarianism- 


Method and Results 


We have tested this hypothesis by correlating ae 
thoritarianism scores obtained separately with Fi 5 
positive and negative items of the MTAI, with rating? 
of teachers by their pupils. To test our hypothesis ee 
quires an external criterion of authoritarianism, unco a 
founded with the subject’s acquiescence. As such rs 
measure, we used pupils’ mean ratings of their teuche 
on Leeds’ “My Teacher” rating scale. This scale eri 
sists of 50 Yes—No—? items of the following tyP 
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“Ts this teacher often ‘bossy’? Does this teacher force 
her ideas on the pupils? Is this teacher usually fair 
with the pupils? Does this teacher think she is always 
right and the pupils always wrong?” In previous 
studies (6), pupils’ ratings of teachers on this scale 
have been found highly internally consistent over 
items and highly discriminating between teachers. 
Further, these mean ratings by pupils have been sig- 
nificantly correlated with experts’ and principals’ rat- 
ings of teachers as to their ability to maintain disci- 
pline, create a friendly classroom atmosphere, establish 
a feeling of security on the part of pupils, exert a 
stabilizing influence on the class, and develop the pu- 
pils’ self-reliance. It seems defensible to use these rat- 
ings as a criterion of the authoritarian personality 
trends of the type at which the F scale and the MTAI 
are aimed. The authors of the MTAI cite The Authori- 
larian Personality as an influence on their description of 
the adjustment mechanisms responsible for undesirable 
teacher attitudes. 

The subjects in this study were 97 of the 98 teachers 
of grades 4-6 in a midwestern city (10). The criterion 
ratings of the teachers were furnished by their 2800 
pupils. The reliability of the mean rating of teachers 
was .91, as estimated with Horst’s formula (11). 

Table 3 shows the correlations of the scores on the 
Positive and negative MTAI items with the pupils’ 
ratings of the teachers. The r of the 38 positive items 
was .13. The 112 negative items were divided, by means 
of a table of random numbers, into three sets of 38, 38, 
and 36. This procedure made the scores on negative 
items comparable as to length of test with the scores 
based on positive items. These three sets of negative 
items correlated .30, .22, and .22 with the pupils’ 
ratings. The mean r was .25. 

The 4 value of the difference between rs of .13 and 
.25, using Hotelling’s formula for the ¢ of a difference 
between rig and ris (12, p. 278), is 1.64. Since the direc- 
tion of the difference was predicted, a one-tail level of 
significance is justifiable; hence, the difference is almost 
significant at the .05 level. This test of significance did 
not take account of the fact that the value of r of .25 
is a mean of three rs and hence is more reliable than a 
single r. 

When all 112 negative MTAI items were combined, 
the score on them correlated .31 with the pupils’ rat- 


TABLE 3 
CorrELATIONS OF Two Types or MTAI Score 
witt Puprts’ RATINGS OF TEACHERS 


(N = 97) 

rpm Mep 

No. of ating of 

Type of MTAI Score | Pome | M | SD | ret Teacher by 
Popitive Items 38 |10.3/11.8] -75 13 

egative Items 

Set A 38 |14.8/10.2| -82 .30 
Set B 38 |17.7| 9.3] -75 .22 
Set C 36 |14.2| 8.9] .73 «22 
All items 112 |46.8)26.0) .91 31 
Total Test 150 90 -26 


t Corrected split-half. 
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ings. This r is higher than the r of .26 obtained with 
scores based on the entire 150-item MTAI. Thus, the 
score based only on negative items is more valid than 


the entire inventory. 


DISCUSSION 


The main point of this paper is that acqui- 
escence may belong to the family of disposi- 
tions—including authoritarianism, obeisance to 
authority, and conformity—that have been 
identified in studies employing methods other 
than agree-disagree items. Accordingly, the 
agree-disagree format and high loading with 
negative items of the F scale and the MTAI 
make a positive contribution to the validity of 
these inventories. 

On the basis of this reasoning, we hypoth- 
esized that, on positive items, the “logical” 
ance of the response might be contra- 
dicted by the “psychological” significance of 
the act of agreeing or disagreeing. Hence, such 
items should be less valid than negative items, 
in which the act of agreeing or disagreeing 
represents psychologically the same authori- 
tarianism or nonauthoritarianism as the logical 
significance of the response denotes. Positive 
support for this reasoning was yielded by the 
correlations of the two types of items in the 
MTAI with pupils’ ratings of their teacher on a 


variable akin to authoritarianism. 
Our results resemble those described by 


Rundquist (15, pp. 142-143) and Cronbach 
(7, p. 406). Rundquist cited studies showing 
negative opinion items more valid in discrim- 
inating between men receiving and not receiv- 
ing public relief, between employed and 
unemployed engineers, between problem and 
normal children. Cronbach found that number- 
right scores on False statements on True-False 
achievement tests were more valid than scores 
on True statements against criteria consisting 
of grades on other types of examinations. 
These results can be assimilated to ours only 
if acquiescence is considered to reflect not only 
obeisance and authoritarianism but also malad- 
justment, low morale, and low intelligence. 
Granted such assumptions, we could prepare 
diagrams like Table 2 to show that responses 
to positive items are equivocal in terms of these 
variables, while responses to negative items are 
not. That these assumptions are plausible is 
indicated by some of Crutchfield’s findings: 
“The assessment staff rating on ‘intellectual 


signific 
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competence’ correlates —.63 with conformity 
score, this being the highest relationship of any 
found...An ‘ego strength’ scale... corre- 
lates —.33 and a staff rating on ‘leadership 
ability, —.30 with conformity” (9, p. 194). 
Similarly, Ancona’s (2) finding that acquies- 
cence is inversely related to need achievement 
ld support our assumption. 
eoor of this hypothesis should be 
made. They should employ both kinds of 
items—positive and negative. Scores for 
authoritarianism on both types of items should 
be validated against external criteria of author- 
itarian personality tendencies in which the 
acquiescence set is not involved, Similarly, 
measures of acquiescence should be correlated 
against measures of morale, ego strength, con- 
formity, and intelligence. 

If negative items prove to be more valid for 
the measurement of authoritarian personality 
tendencies, this will have practical implications 
for test construction. Either inventories should 
consist primarily of negative items, as does the 
Present MTAI; or most positive items should 
be unscored; or scores on positive and negative 
items should be obtained separately and 
weighted empirically, 

It appears, with the hindsight provided by 
the present data, that the authors of the F 
scale and the MTAT built better than they 
knew. No mention of the acquiescence variable 
or of its possible kinship with authoritarianism 
is made in the descriptions of either scale’s 
construction. Levinson (1, p. 59) in describing 
the construction of the anti-Semitism scale, 
gave as his reasons for the use of negative 
items (a) they tend to be more discriminating, 
(b) they can be so phrased that they express 
subtle hostility... (c) they are more reason- 
ably used for measuring receptivity to anti- 
Semitic ideology, (d) evidence showed vari- 

ability of response rather than a set to agree or 
disagree consistently, and (e) the scale was 
valid against external criteria, None of this 
reasoning considers the psychological differ- 
ence between agreeing and disagreeing when 
confronted with a difficult or ambiguous item. 
Until now, the argument against using only 
negative items has not been that they will pro- 
duce a uniform set, but rather that they will 
reflect individual differences in such a set. Previ- 
ous writers have shown that this does in fact 
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occur. Our findings show that it not only oc- 
curs, but that it probably contributes posi- 
tively and in a psychologically meaningful way 
to the validity of the F scale. 

The present results also have theoretical 
significance for the nature of acquiescence, 
authoritarianism, and related variables. Our 
results provide some support for the concep- 
tualization of acquiescence as akin to the 
“yielding” or “conformity” to group pressure 
studied by Asch (3) and Crutchfield (9). 
Crutchfield (9, P. 194) reported an r of .39 
between conformity and the F scale. Rudin 
(14) found the F scale correlated about —.35 
with freedom from contextual influence in per- 
ception of both social and nonsocial stimuli. If 
acquiescence does represent a kind of con- 
formity—in this case to the “pressure” of the 
printed word or of the context in which a 
plausible statement is presented—then it does 
Tepresent an aspect of authoritarianism. 

In short, the present study may aid in under- 
Standing the psychological meaning of acqui- 
escence set, apart from its logical and mathe- 
matical significance, Acquiescence has long 
been suspected of being a psychologically 
meaningful variable. It appears that, in meas- 
uring authoritarianism, we do have one of the 
potentialities to which Cronbach referred: 
“Some of the response-set variance is potert 
ally useful, some of it is an interference wit 
measurement. The problem for the tester is to 
capitalize on the effect of response sets where 
they are helpful to validity... .” (8, p. 19). 

Further studies are needed to pin down the 
Proposition that acquiescence set should m 
capitalized on, rather than shunned, in tests © 
authoritarianism, ethnocentrism, teacher atti- 
tudes, and the like. 


SUMMARY 


Previous workers have hypothesized 
found significant correlations between ie 
California F scale and acquiescence set a A 
able individual differences in tendency To 
respond “agree” or “true” when in doubt). wo 
corroborate these findings and extend them 
the Minnesota Teacher Attitude Inventory 
(MTAI), we constructed an information test oa 
acquiescence, consisting of 50 true-false item 3 
about obscure matters. This test, Cohn’s P. F ms 
Scale of acquiescence, the F scale, and t 
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MTATI all correlated significantly with each 
other in directions expected on the basis of the 
acquiescence set. 

The question was then raised, is acquies- 
cence an aspect of authoritarianism such that 
variance due to acquiescence set is psychologi- 
cally relevant to authoritarianism? By assuming 
that acquiescing reflects authoritarian per- 
Sonality tendencies, we can derive the hypoth- 
esis that “positive” items are less valid than 
_hegative” items for measuring authoritarian- 
ism. Teachers’ scores on these two types of 
MTAI items correlated differently, as hypoth- 
esized, with pupils’ ratings of the teachers on 
variables similar to authoritarianism. 

The hypothesis implies that the F scale, the 
MTAI, and similar inventories are advisedly 
Constructed primarily of “negative” items. 
The psychological meaning of acquiescence re- 
sembles that of authoritarianism, conformity, 
low ego strength, and low intelligence. 
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CULTURAL INFLUENCE ON THE 


PERCEPTION OF MOVEMENT: 


THE TRAPEZOIDAL ILLUSION AMONG ZULUS! 
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RADITIONALLY, theories of the visual 
perception of movement—with which 
the present study deals—have been 
divided into two classes: 1. The nalivistic, i.e., 
theories emphasizing the role of retinal and 
cortical functions relatively unaffected by 
learning, habit, experience, or meaning; and 
2. The empiricistic, i.e., theories giving pri- 
mary weight to the role of experience and 
T, 
ua purposes it is essential to subdivide 


empiricistic theories into two groups: 


Cumulative habit: Stressing the effects of many types 
of early, remote, and generalized experience which by 
transfer or cross conditioning become a major deter- 
minant of the perception of movement. Toch and 
Ittelson (22, p. 199) state that “contemporary empiri- 
cism” favors this type of approach, offering its explana- 
tions of perceived movement in terms of “weighted 
averages of experiential sediments of all kinds acting 
inseparably,” 

Object connotation (meanin: 
movement largely in terms of familiar objects. One sees 
continuous wing motion in an electric sign representing 
a bird in flight, although the stimulus actually occurs 
discontinuously in two or in three fixed positions. This 
theory would hold that our familiarity with birds in 
flight causes us to fill the gaps with perceived motion. A 
good statement of this theory of stroboscopic movement 


g): Explaining perceived 
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may be found in James (9), This author insisted that 
“perception is of definite and probable things.” In 
explaining illusions, James leaned heavily upon their 
resemblance to familizyyobjects. In so doing he was 
merely rendering more concrete and specific Helm- 
holtz’s theory of “unconscious inferences” and Wundt’s 
“assimilation” theory. 


At the present time none of these theories 
can be defended in their exclusive purity. F. 
H. Allport (1) makes this fact abundantly 
clear. No empiricist, for example, can deny 
the native physiological substrate of the 
perception of movement nor its structural 
properties as represented by Korte’s laws 
governing the phi-movement (13), Conversely, 
no nativist can deny the contribution that 
past experience may make to perceived move- 
ment. Wertheimer, for instance, admits that 
“it is certainly correct that past experience 
can influence the conditions (Verhältnisse) of 
stroboscopic vision” (23, p. 79). Von Schiller 
(21, p. 195) makes the suggestion—especially 
important for the present research—that 
attitudinal set and expectancy are particularly 
effective in determining the perception © 
movement in ambiguous (alternativ) situations. 

Many authors who have worked with pen 
ceived movement, e.g., Neff (17) and Hall an 
Earle (7), have favored an eclectic view. T aA 
have concluded that native and experientia 
factors both contribute, as do likewise momen- 
tary set and the previous level of adaptation- 
And they allow, among experiential factors, 
for both cumulative habit and object connotation 
(meaning). 

While the eclectic position is no doubt 
correct there is, as Toch and Ittelson insis 
(22), still a fundamental question concerning 
“the relative primacy or importance of learning 
Processes and physiological conditions.” mage 
tists do tend to lean in one direction or t 
other. The distinction between nativists = 
empiricists still exists. Among the former, 0? 
thinks of Wertheimer (23), Kofika a 
Metzger (14), Michotte (15); among A 
latter, of Helmholtz (8), Piéron (19), Ames A 
Wundt (24), Cantril (4), G. E. Müller (16) 
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Kilpatrick (11), and Toch and Ittelson them- 
selves (22). 


THE CROSS-CULTURAL APPROACH 


To gain light on this dispute psychologists 
have often asked, “How about primitive 
peoples?” If we can find a tribe or a culture 
where relevant past experience can be ruled 
out, we could then determine whether the 
perception resembles that of western peoples. 
If it does so, then the argument for nativism 
is presumably stronger. The first extensive 
attempt to apply this test was made by W. H. 
R. Rivers during the Torres Strait expedition 
in 1898 (20). Rivers presented to the island 
natives a whole array of visual illusions and 
compared their reports with western norms. 
For some of the illusions there were no appre- 
ciable differences; for others, the natives 
seemed on the whole less susceptible than 
westerners. While Rivers himself does not 
make the point clearly, his results seem to 
show that illusions involving object connota- 
tion (e.g., a European street scene) are far 
less compelling to the natives than are illusions 
having no such object connotation (e.g., the 
rotating spiral) (20, esp. PP- 130 f.). 

It is not easy for western psychologists to 
visit primitive tribes, nor to conduct among 
them adequately controlled experiments. The 
present article, however, deals with one such 
attempt. But before we describe it, the theo- 
retical point at issue should be made entirely 
clear: We do not claim to be testing the merits of 
the nativist or empiricist positions directly. For 
reasons that will appear in the course of our 
discussion, we do not believe that comparative 
al studies on western and on primitive 
an solve this particular riddle. We 
have illuminated the part played by 
(meaning) in the perception of 
motion as over and against the part played by 
either nativistic determinants or cumulative 
habit, Our experiment is not able to distinguish 
between the role of these last two factors. 


perceptu 
peoples c 
claim only to 
object connotation 


The Rotating Trapezoidal Window 

Before the days of Gestalt psychology it 
was customary to regard visual illusions as 
oddities, as exceptional experiences to be 
accounted for either in terms of nativistic or 
experiential constraints. Today, however, we 
make little distinction between illusions and 
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veridical perceptions, since no illusion lacks 
veridical elements and no veridical perception 
is devoid of subjective shaping. So-called 
illusions are simply instances of perception 
where the discrepancy between impression and 
knowledge (whether the knowledge be the 
subject’s or the experimenter’s) is relatively 
striking. It is in such “looser” conditions of 
perception that theorists often seek to obtain 
light on the relative weight of factors entering 
into the normal perceptual process. The 
reasoning is not unlike that which leads psy- 
chologists to study exaggerated functions in 
psychopathology in order to obtain light on 
the same but less exaggerated functions of the 
normal mind. 

Our experiment follows this logic, making 
use of the rotating trapezoidal window de- 
scribed by Ames (2)—a device that has been 
called “a dramatic masterpiece of ambiguous 
stimulation” (1, p. 276). The window (Figure 
1) is so proportioned that as it rotates, the 


length of the longer edge is always longer on 
the retina than is the shorter edge (even when 
the shorter edge is nearer). The resulting 
perception is normally one of oscillation or 
sway; the observer apparently tending to 
keep the longer edge nearer to him. Instead of 
seeming to rotate, as it actually does, the 
window is seen to sway back and forth in an 
arc of 90 to 180 degrees. 

An appended cube and rod add great interest 
to the illusion, since the perceived rotating of 
these objects conflicts sharply with the per- 
ceived sway of the window. In consequence, 
the cube is usually seen to detach itself and 
swing without support in a ghostly fashion in 
front of the window (for that period of time 
when the shorter edge, to which it is attached, 
is in fact nearer to the subject). Similarly, the 
rod bends, twists or “cuts through” the mul- 
lions in order to accommodate itself to the 
phenomenal oscillation. The observer finds the 
bizarre effect both amusing and inexplicable. 

The window used in the present experiment 
is smaller than the original Ames window; 
length 1314 inches, height of the long side 
1214 inches, height of the short side 9 inches. 
Ames demonstrated that within limits the 
ratio of the sides of the trapezoid to one another 
cannot affect the illusion. The optimum speed 
of rotation Ames reports as 3 to 6 r.p.m. Our 
own motor driven window ran slightly less 
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than 2 r.p.m. The original Ames window had 
mullions dividing it into 15 frames; ours had 
6 frames (probably more normal for a “win- 
dow”). For certain comparisons, we employed 
also a true rectangular window, 12” x 1074’ ; 
The explanation Ames gives for the illusion 
maintains (a) that the observer, owing to 
familiarity with rectangular windows assumes 
this window to be rectangular; and (b) that 
owing to long experience with doors, windows, 
and similar objects, the observer has learned 
to interpret longer retinal stimulations as 
coming from nearer objects. Hence, the longer 
edge of the window is interpreted as being 
nearer, and the window is seen to oscillate 
rather than to rotate. 
Ames (2, p. 14) gives a clearly empiricistic 
explanation with a leaning toward the object 
connotation version: 


In his past experience the observer, in carrying out 
his purposes, has on innumerable occasions had to 
take into account and act with respect to rectangular 
forms, e.g., going through doors, locating windows, etc. 
On almost all such occasions, except in the rare case 
when his line of sight was normal to the door or window, 
the image of the rectangular configuration formed on 
his retina was trapezoidal. He learned to interpret the 
particularly characteristic retinal images that exist 
when he looks at doors, windows, etc., as rectangular 
forms. Moreover, he learned to interpret the particular 
degree of trapezoidal distortion of his retinal images in 
terms of the positioning of the rectangular form to his 
Particular viewing point. These interpretations do not 
occur at the conscious level, rather, they are unconsci- 
ous and may be characterized as assumptions as to the 
probable significance of indications received from the 


environment. 

It should be added that Ames does not 
insist that object connotation (“windowness” 
is the sole determinant of the illusion. He 
himself employed a variety of trapezoidal 
figures and discovered that even a plane 
surface of trapezoidal shape arouses the illu- 
sion of sway, though to a much less degree 
than does a “window frame” (2, p. 29). 


The Hypothesis 

In order to test the “object connotation” 
theory, we studied various groups of Zulu 
children (10-14 years old) in Natal whose 
own culture is virtually devoid not only of 
windows, but, to a surprising extent, of angles, 
straight lines, and other experiential cues that 
would presumably “cause” the illusion if it 
were wholly a product of experience. Our 


hypothesis therefore is: 
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Zulu children, provided they are unaccul- 
turated (amabinca) will report the illusion of 
sway in the trapezoidal window less often than 
will urbanized acculturated Zulu children 


(amabunguka) or than while (“European”) 
children. 


The Zulu Culture 


Zulu culture is probably the most spherical 
or circular of all Bantu cultures, possibly the 
most spherical of all native African cultures 
(though it would be difficult to prove this 
contention). The word “zulu” means heavens 
or firmament, and the aesthetic ideal of round 
rather than angular styles affects native art, 
architecture, and speech. 

Huts are invariably round (rondavels) or 
else beehive shaped, whereas in other Bantu 
tribes they are sometimes square or rectan- 
gular. Round huts arranged in a circular form 
with round stockades to fence in animals, 
constitutes a typical African homestead 
(kraal). Fields follow the irregular contours of 
the rolling land, and never seem to be laid 
out in the neat rectangular plots so character- 
istic of western culture. See Figure 2. 

The typical Zulu hut has no windows, and 
no word for such an apperture exists. In the 
more primitive beehive grass huts, doors are 
merely round entrance holes; in the rount 
mud huts doors are amorphous, seldom 
ever neatly rectangular. Cooking pots are 
round or gourd shaped. In his studies san 
Zulus, L. Doob (6) finds that the less accu 
turated natives, relative to westernized alae 
show a statistically significant preference 
circles over squares when they are asked A 
choose between designs drawn in these shape 
(personal communication to the authors): jus 

It is commonly said in Natal that Zu 
fresh from reserves cannot plow a anne 
furrow and are unable to lay out a eet 4 
flower bed. Such inability is of course pee 
with experience and training, but the ae 
defect would seem clearly related to the 
circularity that is characteristic of life on ith 
reserves and to the lack of familiarity W! 
straight layouts. cir- 

Linguistically, the same bias towards E 
cularity is seen. While it is possible to ae 
“round” in Zulu, there is no word for oe ae 
There is a word for “circle” but not for or 0 
tangle.” To speak of window, of square, 
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rectangle at all, a Zulu is forced to borrow these 
terms from Afrikaans or from English—pro- 
vided he is able to do so. 

The Subjects 


The experiment required the use of two contrasting 
groups of subjects (Ss): those who had lived all or most 
of the their lives in western culture, and those who were 
unacculturated. Even in the Bantu reserves or in Zulu- 
land itself it is not possible to make certain that a resi- 
dent does not know what a window is like. While 
schools, churches, and health centers are few and far 
between, they are nevertheless within the possible 
Tange of visitation by most native inhabitants, even 
children. Our experiments at Polela and Ceza took 
Place in health centers, at Nongoma ina court house. 
The Ss, to be sure, were brought in from remote parts 
of the reserves by lorry, or came on foot; but they had 
at least this one-time acquaintance with a rectangular 
building and windows. 

‘Still, it is possible to say that the experiment dealt 
with two widely contrasting groups in respect to the 
degree of experience they had had with western archi- 
tecture and ways of life. Some members of the more 
primitive groups, for example, may never have seen 
windows with rectangular panes of glass prior to the 
actual experimental situation. 

By using herd boys as Ss—mostly between 10 and 
14 years of age (few of them knew their age exactly)— 
We were able to make certain that they had never been 
off the reserves and had never attended school. Boys of 
the same age comprised our urban control groups: one 
group of European boys at Greyville Community 
Center; another group of Bantu boys at the Lamont- 
ville Community Center in Durban. Most of these 


urban boys were attending school. 1 
Our major experiment thus involved the following 


groups: 
Group A Urban European boys (20 cases) 
Group B Urban African boys (20 cases) 
Group C Polela Rural Africans (20 cases) 


Group D Nongoma Rural Africans (20 cases) 

A rough indication of the cultural differences between 
the rural and urban groups lies in answers to the ques- 
tion asked at the end of the experiment about the 
rectangular window, “What does this look like?” The 
percentage saying “window” or “window frame” 
among the urban children was 88; among the rural, 45. 


Procedure 


The procedure involves four conditions, varying 


two factors bearing on the perception of the illusion: 
monocular vs. binocular viewing and distance from the 
stimulus object.: Each S $7 first the rectangular, and 
then the trapezoidal w’ in at least 3 full revolu- 
tions under each of t?“ s0llowing conditions. 


First trial: 10 ft. binocular 
Second trial: 10 ft. monocular 
binocular 


Third trial: 20 ft. 

Fourth trial: 20 ft. monocular í 
It was thought that this order would impose the 
“hardest” condition first and therefore minimize the 


effects of suggestion. One might fear that if at 20 feet 
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with one eye a subject easily perceived the illusion he 
might become accustomed to expecting oscillation in 
the trapezoidal window at closer distances and under 
binocular conditions. Conversely, of course, it might be 
argued that a subject who cannot perceive the illusion 
at ten feet binocularly would form an expectation that 
might prevent his obtaining it under easier conditions. 
We shall refer later to a control experiment (starting at 
20 feet monocularly) designed to check on any suggestive 
effect that might arise from our order of presentation, 

The experimenter (E) required the assistance of a 
second psychologist who kept records of the Ss’ reports, 
also of an interpreter with all African Ss. Care was 
taken to prevent Ss who had finished the experiment 
from communicating with Ss who had not. 

After being put at ease, the S gave his age (if he knew 
it) and his degree of education (if any). The S then sat 
in a chair placed at the proper distance from the window 
and was told to watch carefully the movement that he 
would see. After approximately three revolutions the Æ 
asked, “How does it seem to you to be moving?” Often 
the S spontaneously used his hands to indicate the 
motion until the Æ was satisfied whether a full rotation 
or a fluctuation was intended. The use of hand motion 
by the S proved to be fully convincing, for when he 
reversed the hand at precisely the right moment for the 
illusion to occur there could be no question concerning 
his experience. This device gave a useful check on the 
accuracy of the translator’s report of the S’s verbal 
statements. 

After obtaining a report for the rectangular window 
in each of the positions, the trapezoidal window was 
inserted in place of the rectangular, and the same 
method of report employed. In addition, the S was 
asked to tell whether the motion of the trapezoidal 
window was “like” that of the first window. This pro- 
cedure served as a further check on the verbal descrip- 
tion and hand report. In nearly all cases it was possible 
to record a clear and unequivocal judgment of the S’s 
perception. Less than three per cent of all judgments 
were listed by the E as “uncertain.” 

Whenever the illusion was reported for the first con- 
dition, the bar was inserted and the S asked, “How 
does the bar move?” and “Does the bar stay straight?” 
On occasional trials when the S had reported both the 
sway of the window and the bending of the bar, the cube 
was attached and the S asked to describe its motion. In 
these cases there was usually laughter (as with Ameri- 
can Ss) and considerable confusion and difficulty 
manifested in describing so unreal and “spooky” a 
motion. Because of the difficulty of communicating con- 
cerning these complex phenomena we make no further 
systematic use of the cube and rod in the present study. 

At the conclusion of the experiment, the S was asked 
what the rectangular window “looked like.” He also 
stated his preference for one of two geometrical draw- 
ings presented to him in pairs (a circle, square, trape- 
zoid). He then received a slight payment for his services 
(usually one pound of sugar or a candy bar and six- 


pence). 
RESULTS 


General resulis. Table 1 gives the results for 
the two unacculturated groups (Nongoma and 
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TABLE 1 
NUMBER REPORTING ILLUSION 
(Boys 10-14 Yrs. of Age, N = 20 in Each Group) E 
Nongoma Rural Polela Rural African Urban European Urban 
Condition Yes No Upi" Yes No Uncer yes No Uncer- yes No Uncer 
i dition (10’, both eyes) 3 17 Q 4 14 2 13 7 0 11 9 0 
Se o (10’, one eye) 14 6 0 16 4 0 19 1 0 19 1 0 
Third condition (20’, both eyes) 8 12 0 17 1 2 16 3 1 16 4 0 
Fourth condition (20’, one eye) 18 2 © 17 2 1 18 2 0 19 o 1 
Totals 43 37 0 54 215 6 13 1 6 14 1 
Polela Reserves) and for the two districts TABLE 2 
within metropolitan Durban, African (Lamont- DISTRIBUTION OF SCORES č  ěž 
ville), and European (Greyville). A NUADH Yeh 
Combining all four conditions, there is a Sample i Avani 
very significant tendency for the urban groups 4]3] 2|1jo 
to report the illusion more often than the rural = =! 4 a a] TT 
groups (corrected 2 X 2 X? = 15.34; p < 001). Pole flao) 31i S14 A 
This difference is most marked with the first Urban African 32i 4] 2] 2/0] 3:30 
condition (corrected 2 X 2 X? = 12.38; p < Urban European | 11 | 5} 3| Of} 1) 3.25 
-001). There is also a significant trend with the Total (V = 80) | 29 23 | 18 [7 [3 | 2.85 


second condition (corrected 2 X 2 X? = 4.80; 
$ < .05) and a slight tendency with the third 
condition (corrected 2 X 2 X? = 1.87; p < 
.20) for the rural children to observe the illu- 
sion less often than the urban children. 
Virtually no difference exists with the fourth, 
20 feet and one eye condition. 

Table 2 expresses the results in an alternative 
way. Since four conditions of presentation 
were used we can determine in how many of 
these four conditions on the average each of 
the cultural groups reported the illusion. For 
the two unacculturated groups combined, 
the illusion is reported in 2.425 of the four 
conditions, while for the acculturated groups 
the average is 3.275. This mean difference has 
high statistical significance (¢ = 3.51; p < 
.001). 

It is evident from Tables 1 and 2 that city 
dwellers, whether Zulu or European, find the 
illusion somewhat more compelling than do 
rural (“primitive”) natives. This tendency is 
especially pronounced at 10 feet with binocular 
vision—a condition when binocular cues of 
true depth (in this case, true rotation) are 
most plentiful. The reader will also note that 
the results for Polela (rural) stand somewhat 

between those for the city children and those 
from Nongoma (rural). At ten feet binocularly, 
they resemble those of Nongoma; at twenty 
feet binocularly, those of the city boys. Polela 


is, in fact, one hundred miles closer to Durban 
than is Nongoma which lies in the heart ° 
Zululand. There is no doubt that the children 
in Polela have somewhat more familiarity 
with western architecture (specifically with 
windows) than do the children of Nongomé- 
The results (Table 2) correspond to a 
continuum of cultures: city children having 2 
maximum of familiarity with western architec- 
ture, Nongoma children the least. 


Preference for circles. Following the exper 
ment, all Ss were shown drawings of a squares 3 
trapezoid, and a circle (in pairs), and asked t 
express a preference. Table 3 indicates 
those who expressed a preference for the Cree 
(at least once in the two pairings) tend in en 
African groups to report the illusion /ess oft di- 
This tendency holds for all experimental cof, 
tions for all three African groups- ets 
relationship is statistically significant, how® ee 
for only the binocular conditions. cant 
preferring Zulu children report the 1 oe 
significantly less often than the angle P 
ferring Zulus in Conditions 1 and 3 (onen 
2 X 2 X? = 3.89; p < .05), but the differe j- 
in the monocular, second and fourth, cel 
tions is not significant (corrected 2 X 2 ap’ 
0.18, m.s.). There are no differences 


Vay 


Tue TRAPEZOIDAL ILLUSION AMONG ZULUS 


TABLE 3 
PERCENTAGE OF CASES REPORTING ILLUSION AMONG 
SUBJECTS PREFERRING AND NOT PREFERRING 


CIRCLE 
Combined African European Group 
Groups 
Noo WS" 
Condition = 
_ Not Pre- p A Not Pre- 
“eae "ae Ee 
aa - = F= 
10 binocular 28 43 58 50 
10 monocular 79 86 100 88 
20" binocular 59 86 83 75 
20’ monocular 87 90 100 88 
All conditions 63 76 85 75 


proaching significance between the circle and 
noncircle-preferring European Ss. 

Let us assume that the aesthetic preference 
for circles may provide an index of the subjec- 
tive closeness of the individual to Zulu culture 
(since it is, as we have scen, overwhelmingly 
a circular culture). If we do so we may say 
that this subjective closeness seems to pre- 
dispose the S to resist the illusion. Stated in 
terms of transactional theory, rectangles and 
trapezoids have less functional significance for 
him. His preception of the window’s rotation 
Is accordingly more frequently veridical. 

We have noted that this influence is si 
cant only in the conditions involving binocular 
perception. A reasonable interpretation would 
be that cultural effects cannot easily change 
the basic demand character of the illusion 
monocularly perceived, but may do so when 
binocular conditions leave more latitude for 
choice and for interpretation among a greater 
number of cues. 

This result then, so far as it goes, lends 
some weight to the contention that “cultural 
significance” is playing an appreciable part in 
determining the results. 

Illusion with rectangle. Before viewing the 
trapezoidal window, every Ss in all four condi- 
tions first saw the rectangular window rotating. 
The purpose was to make sure that the sway 
(oscillation) reported for the trapezoid was 
judged to be different from the motion of the 
rectangular window. In most cases, indeed, the 
S was able to make the distinction clearly, 
indicating by gesture and by words that the 
rectangular window went “round and round” 
whereas the trapezoid oscillated. 

There were cases, however, W 


ignifi- 


here the rec- 
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tangular window was reported as oscillating. 
In fact, nearly one-third of the 80 Ss reported 
such a phenomenon at one or more of the 
four conditions. The actual percentage re- 
porting sway in the rectangular window at 
each of the four conditions is: 


First condition 0 
Second condition 8 
Third condition 16 
Fourth condition 28 


It is conceivable that this curious and some- 
what unwelcomed finding may be a result of 
a “suggestive” order of presentation. Thus, 
no S seeing the rectangle before the trapezoid 
under the first condition (10 feet binocularly) 
reported the phenomenon. And, with the 
exception of 3 cases, no S reported the 
rectangular illusion in the second, third, or 
fourth condition unless he had previously 
reported the trapezoidal illusion. Altogether, 
81 per cent of our Ss reported the illusion 
monocularly at 20 feet for the trapezoidal 
window, but only 28 per cent did so for the 
rectangular window under the same condition. 
In virtually all these cases the Ss had grown 
accustomed to seeing oscillation at some 

revious stage with the trapezoid. 

Pastore (18), however, finds that more than 
half of his 58 American college Ss reported 
h the rectangle during a three-minute 
and at considerable distance from 
the window (where the retinal angle subtended 
by the two shapes may be subliminal). He 
does not tell whether the Ss had grown accus- 
tomed to the sway of the trapezoid before 
they reported sway in the rectangle. We must 
leave this problem for the time being unsolved. 


A CONTROL EXPERIMENT 

In order to determine whether unwanted 
suggestive effects, caused by our order of 
presentation, were influencing the results at 
the optimal stage for the trapezoidal illusion 
(viz, 20’ monocularly) we simplified our 
procedure with entirely new Ss. Urban and 
rural Africans served as before. To secure the 
latter, we visited the Ceza Medical Mission 
in Zululand, approximately 20 miles from 
Nongoma. Both Ceza and Nongoma are in 
the deepest part of the native reserves, Over 
200 miles north of Durban. Polela, as we have 
said, lies about 100 miles west of Durban and 
has more European influence (e.g., European- 


sway wit 
exposure, 
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style architecture). This fact, we repeat, seems 
to explain why, as Tables 1 and 2 show, the 
Polela Ss report the illusion somewhat more 
frequently than do Ss at Nongoma or Ceza. 
In the control experiment, the S was asked 
to cover one eye. Sitting at twenty feet from 
the rotating trapezoidal window he then 
described its motion (both in words and by 
hand motion). Later he was seated at ten feet 
from the object and using both eyes described 
the motion, comparing it with the previous 
motion. Finally he was, as in the other groups, 
asked his preference for the circular, square, 
or trapezoidal figures. None of these herd 
boys had ever been to school. 
For an urban control group we used a fresh 
population of Lamontville boys of the same 
age range. 


Results 


Table 4 gives the results, 

These data are practically identical with 
those of the Nongoma and urban African 
samples cited previously. Again, the rural 
Zulu group reported the illusion significantly 
less often than the urban Zulu group in 10 
binocular condition (corrected 2 X 2 ie = 
11.53; p < 001), but no differences appear 
at the 20’ monocular condition. This similarity 
of data proves that the order of presentation 
is not an important variable. 

At the Ceza Mission Hospital we tested also 
a group of eleven expectant mothers, only 
one of whom had ever left the reserve. Eight 
reported the illusion at 20/ monocularly, two 
did not, and one was uncertain. None of the 
eleven, however, reported it at 10’ binocularly, 
These cases confirm the trend in all our tables 
that “primitives” are less able to perceive the 
sway in the trapezoidal window illusion under 


TABLE 4 


NUMBER REPORTING ILLUSION IN CONTROL 
EXPERIMENT 


Ceza (Rural) Lamontyille (Urban) 
N= N=21 


Condition Diea 


Yes No tain Yes No Uncer- 


First condition 22 2 0 20 o 4 
(20, monoc- 
Jar) A 
Sead condi- 2 18 
tion (10, bi- 
nocular) 


4 14 7 0 
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TABLE 5 


PERCENTAGE OF CASES REPORTING ILLUSION AMONG 
SUBJECTS PREFERRING Two CIRCLES 
Combined African Groups (Ceza and Lamontville); 

= 44 


| Preferring Two | Preferring Less 
Condition Circles 0| "than Two 

| vais N=39 
20’ monocular 92 97 
10’ binocular 27 38 


marginal conditions (i.e., at 10’ binocularly) 
than are city dwellers. 

Something should be said concerning the 
qualitative differences reported by Ss who 
first reported the illusion at 20’ monocularly 
and then again at 10 binocularly. Not infre- 
quently their reports at 10’ binocularly were 
“mixed,” that is to say, sometimes they saw 
the oscillation and sometimes not. In every 
case, the S was asked to tell the “difference 
if any existed between the movement seen at 
20’ monocularly and that at 10’ binocularly- 
Most Ss claimed that there was a difference: 
Sometimes the window at the closer distance 
seemed to move faster, sometimes in a bigget 
arc, sometimes even in the reverse direction, 
And often, as we have said, the reports at al 
binocularly were “mixed—the subjects pe 
porting sometimes a full rotation and some 
times a sway. We record “yes” to the illusio® 
at 10’ binocularly if at any point in the ene 
ment the S reports a clear oscillation. Sinc 
the same criteria were applied at both Ceza 
and Lamontville, no source of error is intro 
duced. ; 

If the reader is acquainted with the lysine 
he will no doubt recognize this ambiguity } pi 
the perception at 10° when binocular cues ae 
Powerful evidence for true rotation, and a 
the tendency to see illusory sway likewis? 
exists. Because of this dual tendency we cae 
sider 10’ binocularly as a marginal tondin 
for the illusion. What is important for O 
Purposes is the finding that under en 
marginal conditions urban children, who he 
familiar with western architecture, report a 
illusion much more frequently than do be 
boys on the Zulu reserves. sed 

As in the major experiment our Ss expres = 
their preference, in three paired compariso 1 y 
for a circle, trapezoid, or square. Since One 
Seven cases of the 45 failed to choose an 
circle at least once, we changed our criterio? 


— => m aac 


THE TRAPEZOIDAL ILLUSION AMONG ZULUS 


TABLE 6 


PERCENTAGE OF Supyects RECOGNIZING STIMULUS AS 
A “Winpow” Wuo REPORT THE ILLUSION 


Recognizing |Not Recognizing 


Condition “Window” “Window” 
N = 20 N= 
20' monocular 100 88 
10’ binocular 55 29 


slightly from that used in Table 3. We deter- 
Mined the occurrence of the illusion among 
those who chose the circle wice as compared 
with those who chose it only once or not at 
a5 Table 5 shows that in this population 

ewise, Ss who show a preference for circles 
tend—particularly in the binocular condi- 
tion—to report the illusion somewhat less 
frequently than those who do not. The differ- 
ences are not statistically significant but are 
in the same direction as those reported in 
Table 3. The implication of this finding, we 
repeat, seems to be that Ss whose aesthetic 
preference lies with the circularity of their 
tribal culture are the more resistant to the 
illusion. 

At the conclusion of the experiment the 


investigator showed each S the rectangular 


window and asked, “What is this?” (In order 
ildren would have 


to make certain that the chi 
an opportunity to say “window” if they per- 
ceived the resemblance, the question was 
asked in three different ways in the Zulu 
language.) To one or morè of these three 
questionings 67 per cent of the urban children 
said “window,” but only 26 per cent of the 
Ceza children gave the same reply. If we 
combine all cases who said “window” at both 
Ceza and Lamontville we find an appreciable, 
though not statistically significant, tendency 
for them to report the illusion more often than 
do children who did not recognize the window- 
ness of the stimulus object (corrected 2 X 
2 2 = 2.81; p < -10 when both conditions 
are combined). So far as it goes, this finding 
(Table 6) lends support to the object-connota- 
tion theory of the perception of movement, 
especially under the 10’ binocular condition. 


DISCUSSION 


Our most striking finding is that under 
Optimal conditions (monocularly at 20 feet) 
Virtually as many primitive Zulus report the 
trapezoidal illusion as do urban Zulus or 
Europeans. Taking this one partial result by 
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itself we can say that the experiment supports 
either the nativistic or the cumulative habit 
theory. It does not by itself give us grounds 
for choosing between them. 

Nativists might argue, for example, that 
whenever a longer and a shorter projection 
on the retina occur simultaneously the longer 
will assume a figure character and therewith 
a frontal position in the perception (other 
conditions being equal). Thus, some form of 
isomorphism obtains between retinal-cortical 
processes and the perception itself. 

An empiricist with a “cumulative habit” 
preference might say that myriad ocular-motor 
adjustments from infancy have built up @ 
dependable expectancy that longer projections 
on the retina will betoken nearer objects. One 
learns through repeated experience that longer 
retinal images of trees, cattle, people, stand 
for nearer objects (provided, of course, that 
one assumes such objects to be of equal size 
whether far or near from the eyes). It is not 
necessary for the S to have acquaintance 
with specific objects (in this case a window) in 
order to make a similar inference. The transfer 
effect is wide. Even the shadows painted on 
the rotating window are reminiscent of the 


S’s experience with shadows in nature. Old 
s automatically condition novel ex- 


experience: 
even though the latter are only 


periences 


analogous. 
One assumption that may play a decisive 


part in this case is the assumption of “right 
angularity.” From earliest life the child is 
conditioned to the fact that perpendicular 
objects best withstand the force of gravity. 
Circular though his culture is, his basic frame 
of reference is still one of verticals and hori- 
zontals. Seeing an entirely new object (the 
trapezoidal window) he assumes unconsciously 
(no less than do people who are familiar with 
windows) that its shape is rectangular. Just 
like people in western culture he may make 
this assumption even if he “knows” that the 
object is not in reality rectangular. This 
assumption, together with the assumption 
that longer objects on the retina are usually 
nearer objects, would predispose him to per- 
ceive that the longer edge of the window is 
always nearer (thus inducing the perceived 
oscillation). No less than people in western 
culture he would fail to “correct” his assump- 
tions of right-angularity and of long-edges- 
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being-near-edges by his “knowledge” of the 
trapezoidal shape of the stimulus. . 
Our major result is clearly not compatible 
with a narrowly conceived object-connotation 
theory. It is not necessary for the S consciously 
to assume that the object is a window in order 
to experience the illusion. True, as Table 6 
suggests, the specific object connotation seems 
somewhat to favor the illusion, but it is clearly 
not the decisive determinant. Thus, for 
example, 88 per cent of those who did not 
consciously recognize the frame as a “window” 
nevertheless experienced the illusion at 20 
monocularly. 

Yet, at the same time, our results show that 
object connotation cannot be disregarded. It 
also plays a part. Let us review the evidence: 

1. Under all suboptimal conditions, as we 
seein Tables 1 and 4 (10 feet monocularly, and 
binocularly at 10 or 20 feet) there is a tendency 
for unacculturated Ss to report the illusion 

less frequently than do the acculturated. 

2. The Ss who recognize the “windowness” 
of the stimulus object tend to report the illu- 
sion somewhat more frequently especially at 
10 feet binocularly (Table 6). 

3. African Ss expressing a preference for 
circles (assumed here to indicate a subjective 
closeness to the rotund Zulu culture) tend to 
report the illusion less often than those ex- 


pressing preference for angular figures (Tables 
3 and 5). 


We conclude that experience 
identification with, western cullure m 
likely that the illusion will be perc 
marginal (suboptimal) condilions. 

One fact reported by Ames, and mentioned 
above, supports our interpretation. He finds 
that a plane trapezoidal frame yields appre- 
ciable oscillation, but that the addition of 
mullions, panes, and shadows enhances the 
illusion. In other words, specific “thingness” 
contributes to the experience though it does 
not account for it wholly (6, pp. 28-31). And 
we may again allude to von Schiller’s conten- 
tion that expectancy is effective in determining 

perceived movement under marginal (, 
nativ) conditions. 

May Brenner (3) likewise makes the point 
that when marginal conditions obtain, the S 
is forced to depend on stimulus meaning. On 
the other hand, when optimal stimulus con- 
ditions obtain, even brain-damaged cases re- 
port apparent movement to much the same 
degree as do normal cases. 


with, and 
ake it more 
eived under 


alter- 
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Several other experiments have dealt with 
the effects of meaning on apparent movement. 
Thus, Jones and Bruner (10) report that in a 
stroboscopic experiment the line drawing of a 
man is seen to be in motion more actively than 
is a nonsense figure. De Silva (5) had previously 
established this same fact. Jones and Bruner 
conclude that “the more probable and prac- 
ticed the movement, the more adequately will 
the movement experience maintain itself under 
suboptimal conditions?” (10, p. 165). This 
conclusion is in agreement with our own. 

Toch and Ittelson (22) report an experiment 
in which drawings of a bomb stroboscopically 
presented are seen in a downward (falling) mo- 
tion, whereas drawings of an airplane presented 
in an identical fashion are seen in an upward 
(rising) motion. Though this experiment taken 
by itself favors an object-connotation theory, 
the authors argue in general for the cumulative- 
habit theory. They contend, rightly no doubt, 
that the nativist position cannot be adequately 
tested short of a longitudinal study of infants 
from birth. They believe that generalized past 
experience accounts for our major dispositions 
to perceive stroboscopic or other illusory move 
ment, but allow that under conditions of am- 
biguity or equivocation specific meaning 
connotations will enter to determine the direc 
tion and nature of the movement. Here, t00, 
our findings are concordant. 5 

If we leave the field of experimental testing 
for a moment, we can find many familiar 10- 
stances of the role of object connotation K 
resolving perceptual ambiguities. A streak © 
light in the night sky may be seen as a shooting 
star, as distant fireworks, or as a jet pani 
depending largely on one’s expectations. Bar 
lett tells of the Swazi chieftain who parar 
all traffic policemen in London as friendly, 
ings, because in Swazi culture the Lag 
arm is an amiable greeting. A child in a de A 
tist’s chair, more familiar with spaces 
than with nitrous oxide, perceives the inhala i 
as a spaceship toy. Every projective test a 
Sumes that ambiguous (multivalent) en 
will receive subjective structuring on the ba 
of need, set, expectancy, or habit. of 

Our experiment does not introduce factors c 
need or of set, but deals only with the pan 
of past experience (meaning) as a determin 
of perceived movement. It may, BORET a 
pointed out that among sophisticated OP eg 2 
of the trapezoidal illusion under marginal jun 
ditions (e.g., at 10 feet binocularly) a V° 
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tary effort to see or not to see the window as 
oscillating (or as rotating) can also be effective, 
especially if the observer picks out some detail 
of the window to watch during the rotation, 
thus inhibiting the impression as a whole. 
Meaning is not the only determinant entering 
into the resolution of perceptual ambiguity, 
but it is one of them. 

Returning to James’s statement that, “Per- 
ception is of definite and probable things,” we 
may say that under optimal conditions of 
stimulation definite structure is conferred by 
physiological conditions or by deeply ingrained 
functional habits of spatial adjustment, or by 
both. But when marginal conditions prevail, an 
association with the most “probable” object is 
often called upon to provide the definiteness 
that is otherwise lacking. 

What we have called “marginal” conditions 
should receive a further word of explanation. 
We use the term in our experiments to indicate 
that perceptual conflict is present. Under bi- 
nocular conditions (especially at 10’) there are 
many cues that “give away” the true rotation; 
at the same time there are operating also the 
assumptions that the window is rectangular 
and that longer objects are nearer. Under such 
a condition of conflict our finding is that urban 
children resolve the conflict with the aid of the 
supplementary assumption of “windowness.” 
Not being able to draw on this supplementary 
assumption, the rural children as a rule resolve 
the conflict in favor of the binocular (or true) 
evidence. In this particular case, therefore, one 
might say that the primitive children see things 
“as they are” more often than do the children 


of civilization. 


CONCLUSION 
motion as represented in 
dal window is governed, 


under optimal conditions, by nativistic deter- 
minants or by the unconscious utilization of re- 
sidual (but not immediately relevant) experi- 
ence, or by both. (Our experiment does not 
enable us to decide this issue.) At the same 
time, object connotation (meaning) based on 
closely relevant cultural experience helps to 
determine the nature of the perceived move- 
ment under marginal conditions. 

An adequate theory of perceived movement 
must therefore allow a place for the subject’s 
Specific assumptions of meaning even though it 
cannot be based solely on this foundation. 


The perception of | 
the rotating trapezo! 
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A MOTIVATIONAL FACTOR IN CONFORMITY! 


GEORGE MOELLER anb MORTIMER H. APPLEZWEIG 
Connecticut College 


ONFORMITY in group situations may 
arise from many sources. This is a 
report of research into the effects on 
conformity behavior of individual motivation 
ial approval. 
— ee indicated elsewhere (2, 3, 4), any 
of a number of behavior modes may serve as 
means toward satisfaction of a particular 
motive, and, conversely, a particular mode 
may serve any of a number of motives. This 
point of view implies that prediction of an 
individual’s mode of behavior in a given situa- 
tion must be based on knowledge of the situa- 
tion and of the individual’s motivation, Con- 
forming behavior, as an example of one such 
mode, might then be predicted from a knowl- 
edge of individual motivation patterns, pro- 
vided the performance situation permits 
gratification of the relevant motives in this 
way. 

The present study was designed as a test of 
this general hypothesis. It was predicted that 
college students with high motivation for 
social approval would be more likely to agree 
with the judgments of their peers in the type of 
conformity situation developed by Asch (5, 6) 
than would students who were not so moti- 
vated. This hypothesis in no way precludes the 
Possibility that conformity behavior may 
result from other patterns of motivation in 
other situations, but, rather, states a specific 
relationship between the Strength of one 
motive and the occurrence of one mode of 
behavior in a particular situation. 


METHOD 
The Measure of Motivation 


A new forced-choice motivation questionnaire, The 
Behavior Interpretation Inventory (9), was administered 
as part of the regular freshman orientation testing pro- 
gram at Connecticut College in the Fall of 1954. The 
inventory consists of 59 items, the stem of each repre- 


1 This article constitutes Technical Report No. 5, 
Project NR 172-228, Contract Nonr 996(02) between 
the Group Psychology Branch, Office of Naval Re- 
search and Connecticut College. Permission is granted 
for reproduction in whole or in part for any purpose of 
the United States Government. A report of this research 
was given at the meetings of the Eastern Psychological 
Association, New York City, April, 1957. 
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senting a small sample of behavior, with four “reasons” 
given as alternatives. The four alternatives for each 
item are specific to the item, but are always some form 
of (a) escape from present unpleasantness, (b) concern 
for a secure future, (c) attainment of social approval, 
and (d) consistency with a self-picture. These four 
general motive categories were selected after a logical 
analysis of systems of classification of motives by 
Maslow (8), Murray (10), and Stein (15). A distinction 
was drawn between specific behaviors (modes) and 
generalized goal orientations (motives). Behaviors often 
grouped under the category of achievement motivation, 
for example, were considered to represent more appro- 
priately behavior modes than underlying motives. A 
particular mode (e.g., acquisition, aggression, affilia- 
tion, etc.) can serve any one or more of an individual’s 
motives, depending upon the situation in which it is 
called forth. 

The utility of these particular motive categories was 
checked empirically in a study using an open-ended 
(sentence completion) form of The Behavior Inter prela- 
tion Inventory (1). The four categories were found 
sufficiently inclusive to permit scoring of all response 
themas in the protocols obtained, Independent evidence 
for the usefulness of this motive classification system 
can be inferred from a recent study of men under mili- 
tary combat stress (11) where a list closely appro e 
ing our categories was obtained. The four choices ! 
wise resemble the four “wishes” of Thomas (17). four 

Subjects (Ss) were asked to rank each set of fo a 
alternatives in the order of likelihood that they wer 
the reasons for the behavior described in the item ner 
Since the item stems represent a variety of ees 
and behavior modalities, the frequency of preference t0 
a given motive was taken as an index of the streng f 
of the motive relative to that of the other motives f0 
a particular individual. Score on a given scale, then, 
was the total number of first choices assigned to alter 
natives appropriate to that scale. A total of 25 behavior 
modes was used in the 59 items of the inventory» 
covering a wide range of situations and types of ane oa 
The two examples below represent modes of Pout ali 
action (#35) and physical sentience (#52). In brac! a 
following each alternative is the number represent 
the motive category (see above). y but, 

35. He has been in poor health for some time, As 

despite this, he is engaging in strenuous physi 

activity because: af 
(A) He fears his health might get even wors 
he should take it easier [2]. d lets 
(B) This relieves some of his tensions an! 
him sleep better [1]. 


D 
? Alternative scoring for the sum of ranks has 

used and is described elsewhere, as is the inventory ip- 
its rationale in greater detail (2, 3, 4, 10, 16). g tly 
ventory was originally developed by the autio ni ga 
with Drs. Dee G. Applezweig and Ross Re n which 
Dr. Harvey Burdick participated in the revisions 
resulted in the present form. 
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i TABLE 1 
EAN BII Z-SCORE PROFILES FOR SOCIAL-APPROVAL, 
SELF-APPROVAL AND SOCIAL- AND 
Serr-ApprovaL GROUPS 


BII Scale: 
Group High i F 7 A 
Y [Esame SA | Ane | Appr 
a on 
Social Approval 15|—0.51| 0.07| 1.27|—0.51 
pproval 13|—0.51|—0.73|—0.46| 1.10 


Social & Self Approvall13|—0.78|—0.56| 0.37| 0.67 
Social & Self Approvalj13|-0-78/—0-56; 0.37] 0-67 


(C) He has always believed in keeping one’s 
body strong [4]. 
(D) He hopes that others will admire him for 
trying [3]. 
52. This man is lolling in the warmth of the water 
at a hot-springs resort. He often comes here because: 
(A) He finds his mind is clear and he can think 
best when he is relaxed [4]. 
i (B) He feels it is socially fashionable to vaca- 
tion here [3], 
reife), The sulphur of the water aP 
( po rheumatism (1). 
healthy a Rena that this will help 


Subj ects 


d the warmth 


him stay 


BI 

y e of 263 Ss in the class were divided into 
wi se ii on the basis of scores on the scales of social 
Viduals y, Pproval (scales 3 and 4, respectively). Indi- 
their Sco cre classified as high and low on each scale as 
Kale, oe fell above or below the median on that 
Wed as a, z on the two irrelevant scales (1 and 2) were 
Broups wit urther restriction on sampling to produce 
sales, A t approximately equal means on these two 
0 Dg o ee Sample of 88 Ss was finally obtained, con- 
Ie Self ap S high on social approval (scale 3) and low 
OW on oval (scale 4), Ss high on self approval an 
= 88 a approval, and Ss high on both scales. of 
& inder aa ìn the experiment, 41 were used, the re- 
het in p28 discarded for the following reasons: 3 for 

Cp; € conduct of the experiment, 6 for error on 


t 
ee true 5 a member of the majority, 10 who guesse 
it, Urpose of the experiment while participating 
interview to 


a AN 
nave fee who appeared on subsequent 
Utpose |p Overly suspicious of the proce 
abl e experiment.’ 
ree ee Presents mean BII z-score profiles for m 
È Roteg Pisa whose data compose this study. It shou 
I hat the group high on both scales is actually 
tery; 
pith 3 paws were tape-recorded and reviewed by the 
i hich the thout knowledge of the motivation-grouP to 
8noreg i belonged. Conformity score was likewise 
i se ug otezmining whether the data for a particu- 
2 SCoreg be discarded or not. Actually, the conform- 
“Spicio of the 38 Ss discarded (for knowledge °F 
siderahi o of the purpose of the experiment) varied con- 
Soup me although the mean number of yields for this 
S smaller than for those Ss used. 


dure and/or 
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not as high as either of the other grou 
4 ps on the relevant 
scale. This matter will be discussed further below. 


Procedure 


The experimental procedure followed that employed 
by Asch in his studies of the effect of social pressure on 
the judgment of length of lines (5, 6), with the exception 
that the size of the majority used for the present 
study ranged from 4 to 6, in most cases being 5, while 
Asch used 6 to 8 member groups.* 

Each S was asked to participate in a study of per- 
ception. On reporting, S found 4 to 6 of her classmates 
(who had ostensibly received identical requests) waiting 
for the experimenter (E). The group was brought to 
the experimental room and seated in a semicircle facing 
the blackboard, Æ then read the following instructions: 

This is a task involving the discrimination of 
lengths of lines. Before you is a pair of cards. On the 
left is a card with one line; the card on the right has 
three lines differing in length; they are lettered A, 
B, and C, inorder. One of the three lines at the right 
is equal to the standard line at the left—you will 
decide in each case which is the equal line. You will 
state your judgment in terms of the letter of the 
line, There will be 18 such comparisons in all. 

As the number of comparisons is few and the 
group small, I will call upon each of you in turn to 
fn ateen, i taaa tea mear 
on a piep and ie Pealen during the experi- 

le. Do no ij 
ol Any questions? Suppose we start at the i 
d to the 


P 
hen conducted in accordance 


The experiment was tl 
with the instructions. e 4 to 6 classmates of S oa 
been instructed beforehand to give specific erroneou! 
onses on 12 of the 18 trials. To reduce the possi- 
bilit of error b the first member was 
ee a list of the correct responses vat oe 
i A z 
eens ie erer d the leader’s response. The 
ers O 
mA S pare eee to overact 
j vere truly yera 
poen To all cases the members of the be et 
oe Garr seats so that all but one member was to > S 
xe) 
right, the other to her left. 
At the conclusion 0%. 
members of the group, 12 turn, 


rooms for @ brie! 


e experiment, E asked the 
report to separate 
of course, the only 
iewed. This inter- 
d to inform ture of the ex- 

some of the anxiety which 


jew serve A 
ve d thus relieve 


articularly W “nent at face value, The au- 
s aren er as their schedules 


thors alterna! 
permitted. 


rved as experimenter in 
ocedures. We wish to 
the many students 
d subjects. 


4 Mrs. Joan Lake Kaiser 5e 

the conformity measurement os 
ess our thanks to her and “iss 

Ey as experimental assistan 


who served 
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RESULTS 
Quantitative 


Figure 1 shows the percentage of Ss in each 
group which yielded to the majority on the 
twelve trials in which the majority responded 
incorrectly. It can be seen that the social- 
approval group consistently yielded more fre- 
quently than did either of the other two groups. 
The performance curves of the self-approval 
group and the group high on both scales could 
not be distinguished from each other. Further, 
from the third critical trial forward, the per- 
formance of the three groups seemed to reach 
a stable level, such differences as appeared 
neither increasing nor decreasing consistently. 
Table 2 shows the number of Ss who yielded 
to, or compromised with, the majority a given 
number of times for each of the three groups. 
It can be seen that the frequency distribution 
for the social-approval group is somewhat 
skewed toward high frequency of yielding; 
while the distributions for the other two 
groups are skewed in the opposite direction. 
The Kruskal-Wallis nonparametric measure 
(H) for comparing sum of ranks (14) yielded a 


Soc. App.————w=15 
100) Self App—ne13 


Soc. & Self App.------- N=13 


Number of Subjects (in percent) 


9 10 n 32 
Critical Trials 


Fic. 1. PERCENTAGE OF CONFORMING AND COMPROMISE 
RESPONSES FOR EAcH PROFILE GROUP on THE 
TWELVE CRITICAL TRIALS 
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TABLE 2 
FREQUENCY OF COMPROMISE OR YIELDING IN EACH 
OF THE EXPERIMENTAL Groups ON THE 
TWELVE CRITICAL TRIALS 


Experimental Groups 


Freque 
Frequency Soc. Appr. Soy eel Self Appr. 
(Scale 3) (Scales 3 & 4) (Scale 4) 
N=15 N = 13 N = 13 
12 1 — 1 
11 1 2 aagi 
10 1 1 = 
9 2 — = 
8 3 = 2 
7 1 — 1 
6 2 1 1 
5 1 — 2 
4 — 2 pan 
3 1 1 = 
2 1 — z 
1 1 1 = 
0 = 5 4 


$ close to .05 for these data, indicating that 
the groups do in fact differ in the predicted 
direction. Comparison of the several pairs of 
treatments by U tests further showed that the 
social-approval group differed from the other 
two, $ being less than .05 in both cases (single- 


tailed test). The other two groups do not differ 
from each other. 


Qualitative 


The _ postexperimental interview yielded 
qualitative data that generally support the 
findings reported for the experiment proper 
The standard questions that elicited different 
Tesponses from the several groups, or that are 
of interest in themselves, are shown in Table 3; 
along with the frequency of the various 1°- 
sponses. Data for the two Ss who always 
agreed with the majority are not included 
Since most of the questions could not be aske 
of them, 

Only Questions 1 and Tc evoked answer 
requiring any kind of clinical evaluation 1” 
Scoring. These answers were typed verbatim, 
then analyzed without reference to the Se 
group membership. The criteria for classifica- 
tion are presented below along with apprais@ 
of the answers. In no case did the analysis 8° 
beyond the explicit content of Ss’ statements- 

Responses to the first question were classifie 
by the authors as intropunitive or nonintro- 
Punitive, as this concept is defined by Rose?” 
zweig (12). Intropunitive responses were thos¢ 
in which S expressed doubt about the accuracy 
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E TABLE 3 TABLE 3—Concluded 
ESPONSES IN THE POST-EXPERIMENTAL INTERVIEW 
- Question 5a: 

Question 1: seule you say that you were tempted at 
Would you describe in your own words your imes to answer as the others did? 
experiences during this experiment? Group 

an ee eS S 
Group Response Soc. Soc. & Self 
Response ——— App. Self App. 
Classified as: Soc. Soc. & Self 
App. Self | App. Yes 12 7 5 
& No 2 6 7 
Intropunitive 8 3 5 
Nonintropunitive 6 10 7 r 
Question 6a: 
A you even an as the others did against 
Question 2a: ees 
| When you gave an estimate that disagreed —— Cop 
with the others, did you feel that if the lines Response Soc. Soc & Self 
Fm were measured with a ruler you would turn App. Self App. 
out to be right or wrong? - PP 
Group No i 3 10 
Oa 
Response Soc. Soc. & Self 
App. Self APP. Question 7b: 
Right 10 12 12 Did you wonder what everybody might be 
h Wiongor? 4 1 0 thinking of you when you disagreed? 
Question 2b: Group) 
Then did you feel that if the lines were Response S 
measured, ihe others would turn out to be aoe Se & feat 
wrong? i Pp: š Pp: 
Group xa 10 9 9 
eee ee o 4 3 
Response Soc. Soc. & Self 
à elf App. 
App. = BE Question 7c: 
Wrong 7 g 0 wise did you think the others thought about 
or Right you 
Group 
Question 3: aenea Soc. Soc. & Self 
Would you say that you were concerned ó App. Self App. 
about the disagreements? - 
Gror coos wrong 7 6 1 
Response Soc. Soc. & Self Other 3 3 8 
App. Self App. 
No 5 S z 
Yes or ? 9 5 of her judgment, e.g., “I think when she took 
them away I kept wondering, did I answer 


correctly . . . they were so close sometimes.” 


Question 4a: 
Still thinking back to the time that you oe A response was classified as nonintropunitive 
giving your judgments, would you say that i+ ¢ stated that the group was in error, or that 


doubtful about your 1 1 e 
the others made yon some physical aspect of the situation led to 


acca the disagreements, or she simply reported that 
eo there were disagreements. An example of the 

Response Soc. Soc. & Self atter type of response is, “Everybody was 
i SE A saying the line was A, and I said it was B. No- 

Yes 11 7 5 one else ever varied from their observations, 
3 6 7 but I did.” The content of these examples is 


No 
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typical of those assigned to the two categories, 
but they are unusually brief. : 

The responses to Question 1 of the social and 

self-approval group were more often non- 
intropunitive than were those of the other two 
groups. In response to Questions 2a to 6a, the 
social-approval group differed in the expected 
way from the other two groups. Members of 
that group were: less certain that they were 
right (Q 2a), less willing to state that the 
majority was wrong (Q 2b), more concerned 
about disagreements with the majority (Q 3), 
less certain about the accuracy of their judg- 
ments (Q 4a), more often tempted to answer 
as the others did (Q 5a), and more willing to 
admit that they had answered against their 
own first choice (Q 6a). None of these differ- 
ences is completely convincing when taken 
alone, but in the aggregate they are consistent 
with the hypothesis that members of the 
social-approval group were more concerned 
over their performance relative to that of the 
majority than were members of the other two 
groups. 

The differences that appeared in the inter- 
views do not simply reflect differences in per- 
formance in the experiment proper. If we com- 
Pare the performance Scores of persons who 
said “Yes” and “No” (or “Right” and 
“Wrong”) in reply to a particular question we 
find no differences. This lack of relationship 
between performance and S’s reaction to that 
performance arises, of course, because .S may 
yield uncritically, yield but feel concern, not 
yield and feel concern, or remain independent 
in both performance and attitude. 

The responses to Questions 7b and 7c clearly 
reflect this distinction between behavior and 
the concomitant affect. Data for Q 7b are pre- 
sented to show that awareness of others was 
about equally characteristic of all groups. The 
responses of those who replied “Yes” to Ques- 
tion 7b are analyzed for Question 7c, 

Responses to Question 7c were classified as 
“Something wrong with me” or “Other.” Of 
those who stated that there was “Something 

wrong with me,” nine persons said something 
like, “I thought maybe they were thinking I 
was just crazy”; the other five said, “I thought 
they might be wondering if I was half blind or 
something,” or its equivalent. Members of the 
“Other” group either equivocated, ʻI was just 
wondering what they were thinking,” or, 
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simply accepted the fact, “I thought they 
probably thought I was wrong.” 

The responses to Question 7c provide further 
evidence regarding the difference between the 
social- and self-approval group and the self- 
approval group. The replies of the group high 
on both scales clearly resemble those of the 


social-approval Ss and not at all those of the 
self-approval Ss. 


Discussion 


Figure 1 and Table 2 show that the social- 
approval group did, as predicted, yield to an 
erring majority more frequently than did 
groups of Ss with other motivational profiles. 
The qualitative data, Questions 2a to 6a, are 
also in accord with the prediction. It should be 
emphasized that our prediction was not based 
on categorization of persons as conformers and 
nonconformers. As we have previously indi- 
cated (2, 3, 4), we assume that conformity 
ber se is not a trans-situational trait, but that 
any individual conforms or fails to conform in 
a given situation depending upon his motiva- 
tional profile and the relevance of conformity 
behavior in that situation to his motives. Thus, 
in a situation in which no motive gratification 
is to be expected from conforming, such be- 
havior would not be predicted even where 
there is reason to believe social approval is an 
important motive in the individual’s life. In an 
unpublished study from the authors’ labora- 
tories,5 for example, the following interesting 
results were found. In an Asch judgmental 
situation similar to the one used in this study, 
greater conformity was obtained when the 
majority consisted of a high prestige group 
(college seniors) than when it was composed © 
peers (college freshmen). Even more striking 
was the finding that the presence of a low pres- 
tige majority (high school sophomores) de- 
creased the number of yielding responses an 
increased the number of errors in the directio” 
opposite to that taken by the majority. m 
findings are considered to be entirely consisten 
With the present interpretation of conformity 
behavior as instrumental activity. 

The distribution of conformity scores for 
the social- and self-approval group shows 4 
tendency toward bimodality, but the numbet 


8 This study was conducted by Misses Jane Molloy 
and Barbara Munger as an undergraduate course ' 
search assignment. 
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of cases is too small for this departure from 
unimodality to be significant. Such bimodality 
might suggest that individuals in this group 
choose one resolution or another of the motive 
conflict, i.e., they conformed or did not. The 
qualitative data are quite interesting in this 
regard. 

The answers to Question 1 do little more than 
suggest that the social-and self-approval group 
is in some way different from the others. At 
that stage of the interview, most of the Ss gave 
rather evasive answers which they contradicted 
later when asked about specific aspects of the 
experiment. For instance, five members of the 
self-approval group gave seemingly intro- 
punitive responses to Question 1, but all mem- 
bers of that group stated that they believed 
they had given accurate judgments in response 
to Question 2a. 

The data presented for question 7c in Table 
3 in conjunction with the quantitative meas- 
ures seem to establish the status of the social- 
and self-approval group. It appears that this 
group may have been in conflict, as we ex- 
pected, a conflict which they may have re- 
solved by differentiating between their perfor- 
mance and their evaluation of the majority’s 
reaction to it. We noted the apparent lack of 
relationship between performance and attitude 
previously. 

The studies presenting clear evidence of 
conflict in performance involve specific motor 
acts (7). Evidence of other conflicts, however, 
is typically clinical in nature (13). The implica- 
tion of the literature and of our study seems to 
be that in the study of conflict in complex 
situations simple performance measures must 
be supplemented by other means. It appears 
that the social-approval Ss yielded uncritically, 
or yielded but felt concern; the self-approval Ss 
remained independent in performance and 
attitude; and the social- and self-approval Ss 
did not yield, but felt concern over the major- 
ity’s opinion of their independence. 

The factors discussed above appear to ac- 
count for the similarities and differences be- 
tween the motivational groups. However, the 
final samples were not perfectly matched in all 
respects in which matching was attempted 
(see Table 1). The social- and self-approval 
group actually occupies positions intermediate 
between those of the social-approval group and 
the self-approval group on both scales 3 and 4. 
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This fact may have contributed to the absence 
of more clear-cut differences between the 
social- and self-approval group and the others. 
Second, the scale 2 (security) scores of the 
social-approval group are higher than those of 
the other groups. In order to test the effect of 
scale 2 scores, the 41 cases were divided into 
those above and below the median scale 2 
score for the total sample. The conformity 
scores of the two groups so constructed were 
then compared by means of the U test, with 
the finding that scale 2 scores were not related 
to conformity behavior in our test situation.® 


SUMMARY AND CONCLUSIONS 


Two groups of women’s college students were 
placed in a judgmental conformity situation 
of the sort studied by Asch to test the hypoth- 
esis that persons with high social-approval and 
low self-approval motivation would yield to a 
unanimous, but erring, majority in judgment 
of length of lines more frequently than would 
persons with the opposite motivational profile. 
A third group with high motivation in both 
areas was included to provide information 
about the effects of motivational conflicts in a 
conformity situation. The measures of social 
and self approval were obtained from a four- 
scale measure of motivation, The Behavior 
Interpretation Inventory. 

The results were in accord with the major 
hypothesis. The groups differed as predicted in 
frequency of conformity to the erroneous 
judgments of the majority. Comparison of the 
groups by pairs indicated that the high social- 
approval—low self-approval Ss yielded to the 
majority significantly more often than did 
either of the other groups. The performances 
of the other two groups were similar. 

Qualitative data obtained in postexperimen- 
tal interviews indicated that Ss in the conflict 
group differed from those in the self-approval 
group in their evaluation of the majority’s 
attitude toward their performances. It is sug- 
gested that the high social- and self-approval 
Ss experienced conflict, which they resolved by 
dissociating their performance from their emo- 


tional reactions to it. 


6 We do not mean to imply that concern for security 
(nonsocial) is necessarily unrelated to conformity be- 
havior in all situations, but that this particular situa- 
tion does not appear to have been influenced by this 
motive as measured. 
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ture (1, 2) a number of experiments have 

J examined the relationships between cer- 
tain of his suggested communication structure 
properties and group performance (e.g., 3, 4, 6, 
12). In general, the experiments have employed 
a self-paced task with a single decision re- 
quired and have utilized nets differing in the 
number and pattern of communication links 
between subjects. As Luce et al. have noted, 
“,.. the prototype groups are those formed 
by society to produce accurate information 
Outputs in a short period of time from certain 
input data—information processing groups 
(8, p. 82). , 

The present experiment is an extension of 
several earlier studies (7, 9, 10, 11) which, 
although concerned with information process- 
ing groups, focused on a different characteris- 
tic of communication structure and employed 
a forced-paced task. The different emphases 
reflected an interest in problems of division of 
labor in a slightly different class of information 
handling groups, i.e., those formed to produce 
multiple outputs from certain classes of chang- 
ing input data, where both output and input 
requirements entail some division of responsi- 
bility. One critical variable manipulated when 
allocations of responsibilities are varied is the 
degree of information exchange required be- 
tween group members. The term “communi- 
cation structure” was used to refer to the 
pattern and volume of required intermember 
communications which resulted from the 
nature of the task imposed on the group and 
the way responsibilities were allocated rd 
the group (9). By imposing a task with si, 
and known, display-control relationships = 
manipulating the allocation of observation an 
out under the Air Force 
rch Center, Lackland 
in support of 
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control action responsibilities within the group, 
structures were produced differing in terms of 
the volume and distribution of required intra- 
group communications. 

As expected, structure difficulty decreased 
as the “autonomy” of group members in- 
creased, i.e., each member was the primary 
source of information required for his deci- 
sions, and information not directly available 
was located at a single external source (10, 11). 
That is, group efficiency decreased with anin- 
crease in the required amount of information ex- 
change and with an increase in the amount of in- 
formation exchange involving multiple sources. 
When the information processing demands of 
the task were increased, the relative differences 
between structures were maintained with 
errors for all structures increasing monotoni- 
cally with increases in the amount of informa- 
tion presented the group per unit time. How- 
ever, even under the “easiest” experimental 
conditions—with a low rate of information 
input and low information exchange struc- 
ture—the groups were able to process (without 
error) considerable less information than might 
be expected from a knowledge of the informa- 
mation handling capacity of individuals (7). 

Although the results supported the initial 
general hypothesis that structure-imposed 
demands for intragroup communication would 
be related in a systematic way to group per- 
formance over a range of task imposed de- 
mands, they at the same time raised additional 
issues. Three such interrelated issues moti- 
vated the present experiment. 

1. There was some evidence that learning 
occurred under all of the task and structure 
conditions explored, but it was not possible to 
determine whether the learning curves were 
marked by sharp discontinuities,’ and whether 


2 Group “learning” curves probably reflect the opera- 
tion of two primary factors: acquisition and improv 
ment of relevant part skills of group members and 
improvement in organization or “coordination.” Pre- 
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there were qualitative and/or quantitative 
differences in the curves as a function of task 
input and structure “demands.” 

2. In addition, if improvement in perfor- 
mance occurred, it should be accompanied by 
changes in the communication behavior of 
group members. If so, would the changes in 
communication be a function of the require- 
ments for information exchange imposed by 
structure and input variations, and could the 
performance of groups be related to the ade- 
quacy with which they adapted their commu- 
nication behavior to these imposed “‘de- 
mands”? 

3. Another aspect of the problem of “adap- 
tation” was the question of the effects of group 
operating procedures for exchanging informa- 
tion on performance. As we noted above, the 
information processing capability of the 
groups was lower than might be anticipated 
from data on individual performance. Such a 
discrepancy may have resulted from the 
method of varying information adopted from 
the experiments and the task used. However, 
there was some indirect evidence that the 
primary limiting factor was the inability of 
groups to set up an efficient system for infor- 
mation exchange (7, 11). 

Groups appeared to be using a “soliciting” 
rather than a “volunteering” system: The 
individual who required information called 
the person who was a source and requested the 
information rather than the source person 
calling the user when a change of information 
had occurred. Since the latter Procedure en- 
tails just half the number of messages as the 
former, it might be expected that information 
transmission would be more effective using a 
“volunteering” system. However, volunteered 
information may reach a group member “out 
of phase” with other essential information 
(and would have to be stored) or at a time 


sumably, the form of the learning curve in any partic- 
ular case will depend on the relative contribution of 
part skills and coordination to the group product 
measure. In general, however, the acquisition of part 
skills should produce gradual improvement in eff- 
ciency, while adoption of a new procedure OF organiza- 
tion could produce a more marked change in efficiency. 
Thus, a learning curve based on group product measures 
may be of typical exponential form but with sharp 
changes in slope at various points, the discontinuities 
being associated with the adoption of a new procedure 
or reflecting a reorganization of the group. Data from 
the early experiments had not revealed such discon- 


tinuities, but very few trials had been used. 
, 
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when other activities conflict with attending to 
the message (and may be lost). Thus, despite 
the fact that fewer messages are required, 
volunteering could prove to be a less effective 
procedure than soliciting. Data on the relative 
efficiency of volunteering and soliciting under 
a range of input and structure conditions 
would help clarify the issue. 

The present study then, is an extension of 
previous experiments and is designed to throw 
some light on the issues raised above. Since a 
scarcity of data bearing on group learning or 
communication is available in situations of the 
sort considered here, no attempt is made to 
advance or test specific hypotheses. Rather, 
the focus is on obtaining objective measures 
that are extensive and reliable, in order to 
shed some light on the relationships between 
task and structure “demands” and group 
performance. 


METHOD 
Task 


The task used in this study was essentially that de- 
scribed in previous reports (7, 10). There were three 
major components, the displays, the controls, and the 
communication system. 

The displays were 6 AC voltmeters on each of which 
an aircraft “instrument” dial was simulated, The dial 
readings were programmed by simple resistance switch- 
ing accomplished at a central console, Each instrument 
could be set at one of three positions. The controls 
were 6 two-position switches which were also labele 
to simulate aircraft controls. Associated with each 
switch was an “operating procedure” card which indi- 
cated the correct switch settings for the various values 
of instrument readings. Correct settings for eac 
switch were determined by the readings on a particular 
pair of instruments, 

Three member groups were used. Each subject (5) 
was assigned to a three-sided booth in which was 
Placed two simulated aircraft instruments, two control 
switches, anda “communication signalling” box (Fig. 1). 
Direct communication was prevented, The Ss coull 
communicate (over a standard 6-volt interphone circu” 
with conventional headsets and microphones) by dee 
Pressing switches which provided communication e 
nections between pairs of booths. The signalling devi¢ 
in conjunction with an operations recorder providen 
an automatic and continuous record of “who called 
whom” and the length of time spent in each comm 
cation. The settings of each of the six control switches 
were also recorded on a 20-pen Esterline-Angus opet® 
tions recorder, east 

The task of the group was to adjust and readjust 
each of the six controls to their proper settings in Te 
Sponse to periodically changing readings on the instru- 
ments. The instruments in each booth contained only 
part, or none, of the information required for adjusting 
the controls in that booth, the additional information 


h 2 eee 
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Fic 1. SKETCH OF APPARATUS A 
1-6 = SWITCHES; IG 


or both of the other booths. Each 
as both a control agent and an 
n with responsibility for ad- 
ing and relaying instru- 


being located in one 
S, then, functioned 
information source perso! 
justing two controls and detect 
ment reading changes to other Ss. 


Experimental Conditions 

Structure. Two structure conditions were employed. 
In the Low Autonomy Structure (LA) each S had two 
controls for which none of the four necessary instrument 
readings was directly available. In High Autonomy 
Structure (HA) each S had two controls for which all 
but one of the requisite instrument readings were 
available in his booth. Thus, in the LA structure, 


each dent upon the other group members 
oo eer oe r while in the HA struc- 


for all his required information, : 
ture each S was dependent upon external sources for 
only one reading. The interdependencies between 
group members represented by a matrix schema is 

shown i i 
ie ek load was defined as the rate = 
change of the instrument readings. The input to the 
group consisted of the readings on each of six instru- 
ments. At a fixed interval the readings on one of the 
six instruments changed requiring the adjustment of 
cither one or two controls. A trial consisted of 18 such 
changes, three for each instrument. The sequence of 
j-random, subject 


instrument reading changes was quasi : 
to the restriction that there be three changes for eac 


ND EXPERIMENTAL ARRANGEMENT: I 1-6 = INSTRUMENTS; S 
's = Instruction Carns; C.B.’s = 


CALL Boxes 


instrument. The number of control adjustments that 
each of the three Ss had to make in response to the 18 
changes was equalized. 

Three fixed intervals between instrument reading 
changes were employed: 6 seconds, 10 seconds, and 
15 seconds. Since the number of changes was constant 
for a trial, this necessitated having trials of unequal 
length for the three different “loads.” 

Operating procedure. As noted above, one comparison 
of interest in this study was between a “volunteering” 
and “soliciting” procedure for information exchange. 
Complete control over the procedure used by the groups 
was unfeasible, but preliminary experience indicated 
that a large measure of control could be obtained by 
differential instructions. The initial instructions to 
groups assigned to the volunteering procedure stressed 
the role of the S as an information source. The Ss were 
told who needed the information they had, and were 
asked to report any changes in their readings to the 
group members who required the information. In con- 
trast, the instructions to “soliciting” groups empha- 
sized the responsibilities of each S as a control agent. 
The Ss were told which other group members had the 
information required for correctly adjusting their con- 
trols and were asked to call these Ss to obtain the 
information. In both cases, several practice trials were 
performed under experimenters’ supervision in which 
the appropriate procedure was again stressed. 


—_— 
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TABLE 1 


INFORMATION TRANSMISSION REQUIREMENTS IN THE 
Two EXPERIMENTAL STRUCTURES* 


Source 
Group of Information 
Member A B Cc 


High Autonomy Structure (HA) 


User of information A 3 1 0 
B 0 3 1 
Cc 1 0 3 

Low Autonomy Structure (LA) 
User of information A 0 1 S 
B 3 0 1 
Gc 1 3 0 


* The entries indicate the items of information (instrument 
readings) held by the group member indicated in the column and 
required by the individual indicated in the row. Entries in the 
diagonal indicate items directly available. 


Subjecis and Design 


The Ss were basic trainees at Lackland Air Force 
Base. Six Ss reported for testing each day; they were 
divided fortuitously into two teams of three Ss each 
for morning and afternoon testing. They were brought 
into the experimental room, seated in their booths, and 
oriented as to the general nature of the task and the 
use of the interphone equipment. Following this, they 
were given a brief practice session to insure that they 
understood how to interpret the instrument readings 
and the operating instruction cards. They then per- 
formed the task under one of the combinations of the 
experimental conditions. 

There were three experimental variables: Structure, 
Operating Procedure, and Load, with two conditions 
of structure and procedure and three levels of load. 
A 2 X 2 X 3 factorial design was used with four groups 
randomly assigned to each cell. Each group performed 
nine trials, each trial consisting of 18 instrument 
reading changes. After each set of three trials there was 
a short two- to three-minute rest period during which 
Ss remained in the booths. 


RESULTS? 


The scores used in the analysis are the num- 
ber of errors in control settings made by the 
group within each of the nine trials. A change 
in an instrument reading in any of the three 
booths required either one or two control 
adjustments; if the correct control settings 
were not made before the next change in an 
instrument reading an error was recorded for 
each incorrect setting. 

Table 2 presents the error scores for the 


3 Acknowledgment is made for the able assistance of 
Mr ‘Edward Amberg both in experimental administra- 


tion and in the statistical analysis reported. 


experimental conditions and trials. The 
analysis of variance of these data is shown in 
Table 3. 

Congruent with the results of earlier studies, 
structure and load are significant. Errors in- 
creased as the rate of change of information 
increased and were greater for the LA struc- 
ture. The difference between the “volunteer- 
ing” and “soliciting” procedures is not signifi- 
cant, although in the expected direction (lower 
errors for “volunteering”), 

The load X structure interaction is signifi- 
cant when tested against error (b) but not 
when tested against the significant second 
order interaction—load X structure X pro- 
cedure. In general, the HA structure was 
superior to the LA structure, the difference 
being most pronounced for the 10-second rate 
of change. However, the relative superiority 
of HA over LA was also a function of the pro- 
cedure used; a volunteering procedure resulted 
in a greater difference between HA and LA 
for the 6-second rate of change, while the con- 
verse was true for the 15-second rate of change. 

Although the six conditions of structure and 
load cannot unequivocally be ordered in terms 
of difficulty, there is little doubt that the 
6”-LA, and 15”-HA conditions, respectively, 
define the most and least difficult ends of the 
continuum. Thus, volunteering appears to be 
more effective than soliciting for intermediate 
difficulty levels. 

Other sources of significant variance are 
those due to trials and the interactions 0 
trials X load and trials X structure. Examina- 
tion of the over-all means in Table 3 indicates 
that there was a monotonic decrease in error 
Scores as a function of practice. Inspection © 
Fig. 2 and Fig. 3 indicates, however, that the 
regression of errors on trials was a function 0 
both load and structure. The rate of improv 
ment increased as the rate of change of infor- 
mation decreased, and was greater for the 3 
structure and the LA structure. There a 
Some evidence of a performance asymptote ‘ Š 
the 15-second trend and in the trends for t 
LA and HA means. he 

Analysis of communication records. From E 
20-pen Operations Recorder record of t 
“calls” between group members, three ae, 
related measures were obtained for each ee 
The number of messages initiated by all thre 
of the group members, the average length © 
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TABLE 2 


MEAN Errors FOR EXPERIMENTAL CONDITIONS AND TRIALS: ENTRIES ARE THE MEAN OF ERRORS FOR 
Four Groups 


| | Trials 
Procedure Structure Load | 
| 1 2 3 4 5 6 4 8 9 
Volunteering LA 6” 49| 49| 44| 47| 49| 46| 46| 52 47 | 429 
| 10” 55| 38| 42| 41| 38 39| 38| 34| 29| 354 
| 15” | 36] 33] 26} 20] 18) 20 19| 18| 23| 213 
140 | 120 | 112 | 108 | 105 | 105 | 103 | 104 | 99 | 996 
HA | 6” 46 | 42 43 37 39 | 34] 30] 23| 34] 328 
10” 30 21 21 11 15 13 11 10 7 139 
| 15” 36 25 21 17 21 20 16 16| 15| 187 
| 112| 88| 85! 65| 75| 67| 57| 49| 56| 654 
Soliciting LA 6” 49| 48) 48| 47| 41| 44| 48| 49) 4 415 
| 10” 53| 49| 45| 46] 39] 37| 36] 42 40 | 387 
15” 31| 38| 33| 34| 30] 31] 30| 27 26 | 290 
143 | 135 | 136 | 127 | 110 | 112 | 114 | 118 | 107 1092 
HA 6" 47| 46] 40| 39| 39| 36| 35] 35| 35] 352 
10” 36 | 30} 23 23 17 17 16 17 15 194 
15” 23 17 15 15 10 12 11 8 9 120 
106| 93| 78| 77| 66| 65| 62| 60| 59 666 
TABLE 3 messages was largely restricted to reports of 
SUMMARY OF ANALYSIS OF VARIANCE OF ERROR SCORES instrument readings. 
3 Sums of | Mean z As with the error scores we were interested in 
S F Ratio ý T A S 
TESNETS 4f | Squares | Squares -differences in these measures of communication 
Total 431 | 86,705. behavior as a function of experimental condi- 
g tions, as well as changes as a function of 
A. Between groups 2 as aoe aige practice. In addition, however, it was of 
Structure 1 | 21/832, | 21,832. |122.65°* interest to know whether there were differences 
Procedure 1 434. 434. a between “good” and “poor” performance 
oe 2 E oe pes groups, within an experimental treatment. 
s a P 1 253. 253. | 1.42 For the latter objective, the four groups 
LXSXP 2 | 2,245. iag: 6.30 within a treatment were dichotomized on the 
B. ae ieee | basis of mean error scores over the nine trials 
Trials 8 | 9,168. | 1,146. eer and “performance” was used as the fifth 
Tr wy a ss. | 1-8) classification variable in the analyses of vari- 
TXP 8 370. 4 | 144 ance. The summary tables of the analyses of 
TXLXS 16 ee ra an variance for number of messages, average 
ao eo Es ia 12. "33 length of messages, and total talking time, 
TXRLXSXP 16 480. | 30. .%& are presented in Tables 4, 5, and 6. 
error (w) 288 | 9,169. 32. Number of messages. For the “number of 
messages” measure, load, structure, and 


* Significant at the .05 level. 
** Significant at the .01 level. 


messages, and the total time spent by all 
three group members in communication. It 
was expected that any change in communica- 
tion behavior would be in terms of the fre- 
quency and duration of messages rather than 
in message content, since the content of 


trials contributed significant variance. The 
number of messages increased as the rate of 
change of information decreased, was greater 
for the LA structure, and increased monotoni- 
cally with practice. There were fewer messages 
with the volunteering procedure than with the 
soliciting procedure, although the difference 
was not significant. 
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MEAN ERRORS 


TRIALS 
Fic. 2. Mean GROUP Error SCORES OVER 
TRIALS FOR THE HA AND LA Groups 


With respect to the result for the load vari- 
able, however, it was apparent that although 
the number of calls per trial increased as the 
rate of change of information decreased, the 
increase was proportional to the length of the 
trial. When corrected for length of trial, the 
number of messages per unit time for the three 
rates of change was approximately equal. The 
rate of message transmission appeared to be 
independent of the rate of change of input 
information; the differences between loads is 
attributable solely to the length of the trial. 

Average length of messages. Table 5 shows 
that significant sources of variance for this 
measure are structure, performance, trials, 
and the trials X load X structure interaction. 
The average length of messages was longer for 
the Jow autonomy structure, shorter for the 
“better” performing groups, and decreased 
with practice. T he changes with practice, how- 
ever, were a function of both load and struc- 
ture. For the LA structure the decrease was 
largest for the 10-second load and about equal 
for the 6- and 15-second loads, while for the 


HA structure the largest decrease was for the 
15-second load with the 6-second and 10- 
second loads following in that order. The 
amount of decrease, however, was correlated 
with the average length of messages on trial 1 
and very little difference in mean message time 
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was evident on trial 9 or in mean message time 
over the nine trials. The interaction may thus 


be reflecting a “catching up” phenomena which 
was accomplished by about the fourth trial. 
The surprising feature of the results was the 


time attributable to load. Since the amount of 
information to be transmitted per trial was 
constant for the three-load conditions, one 
might expect that decreasing the time for com- 


lack of significant variance in mean message | 


munication would have decreased the average 4 
length of the communications, but this did not 
occur. Apparently, as the volume of informa- 
tion to be transmitted increased (due to struc- 
ture variations) the average length of messages 
increased, but there was no change in average 


length with changes in the amount of informa- 
tion to be transmitted per unit time (due to 


load). 


MEAN ERRORS 


Total talking time. As Table 6 indicates, load, 


TRIALS 


Fic. 3. Mean Group Error Scores OVER 
TRIALS FOR THE THREE Loan CONDITIONS 
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structure, procedure, trials and the inter- 
actions—load X structure, and trials X 
structure—contributed significant variance to 
total talking time. Total talking time increased 
with a decrease in the rate of change of infor- 
mation, was greater for the LA structure than 
the HA structure, and was less for the volun- 
teering than the soliciting procedure. Also, the 
difference between LA and HA structures, on 
this measure, increased as the rate of change 
of information decreased, e.g., the difference 
between LA and HA was greater for the 15- 
second load and the 6-second load. 

As with the “number of messages” measure, 
when we corrected total talking time for the 
length of trials there were no differences be- 
tween loads. It seems that whereas the volume 
of information to be relayed (structure) and 
the procedure used for routing information 


TABLE 4 
SUMMARY OF ANALYSIS OF VARIANCE OF 
“NUMBER OF COMMUNICATIONS” 


Source of Variance at | SUBS SE | souares |” Bat 
Total 431 | 170,917. 
A. Between groups 47 | 141,898. | 3,019. 
1. Load 2| 68,790. | 34,395. |23.33°° 
2. Structure 1| 12,392. | 12,492. | 8.47°° 
3. Procedure 1] 3,918. | 3,918. | 2.66°° 
4, Performance 1| 1,306. | 1,306. - 
1X2 2| 2,314. | 1,157. — 
1x3 2| 1,888. 944. — 
1x4 2| 2,276. | 1,138. ~ 
2 x 3 1 5,132. | 5,132 3.48 
2X4 1 65. 65. — 
3x4 1 6 6. — 
1X2X3 2| 3,340. | 1,670. |1413 
1X2X4 2 906. 453. - 
1X3X4 2| 1,499. 750. - 
2x34 1 643. 643. — 
Error (b) including quad. | 26 | 38,323. 1,474. - 
interaction l a x 
ithi 5i 3 . j. 
= ae oy g| 9,542. | 1,193. |22.94°°e 
TX1 16] 1,090. 68. | 1-31 
TX2 8 434. s4. |1.04 
TX3 8 364. 46. | — 
TXE 8 159. 2. | — 
TX1X2 16 829. 52. | 1.00 
TX1X3 16] 1,092. 68. | 1.31 
TX1X4 16 359. 2. — 
TX2X3 8 805. 101. | 1.94 
TX2xX4 8 134. 17. - 
TX3xX4 8 394. 49. | — 
Error (w) including higher |264 13,817. 52. 


— F less than 1. 

* Significant at .05 Jevel. 
** Significant at -01 level. 

»»» Significant at .001 level. 
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TABLE 5 
SUMMARY OF ANALYSIS OF VARIANCE OF “AVERAGE 
LENGTH OF COMMUNICATION” 


Source of Variation | dy | Sumsof | Mean. |F Ratio 
Total 431 1,709,733. - = 
A, Between groups 47 |1,364,473.| 29,031. — 

1. Load 2 950. 475. - 

2. Structure 1 | 749,750.| 749,750. 56.99°** 

3. Procedure 1 17,088.| 17,088. | 1.30 

4. Performance 1 92,138.| 92,138, 7.00" 
1x2 2] 14,995.| 7,498. | — 

1X3 2 31,525.| 15,762. | 1.20 

1X4 2 14,541.] 7,270. — 

2X3 1 1,576.| 1,876. | — 
2x4 1 4,300.1 4,300. | — 
3x4 1 9,343.| 9,343.) — 
1X2X3 2 11,389.| 5,694. — 
1X2X4 2 3,837.) 1,918. = 
2X3X4 1 15,588.| 15,588. | 1.18 
1X3X4 2 55,424.| 27,712. | 2.11 
Error (b) including quad. |26| 342,029. 13,155. 
interaction 
B. Within groups 384 | 345,260. 899. 
Trials 8| 73,434.| 9,179. 13.66%°* 
TX1 16 8,312. 520. - 
TX2 8 3,079. 385. - 
TX3 8 9,234. 1,154. | 1.72 
TX4 8 3,862. 483... | — 
TX1X2 16 28,980.| 1,811. | 2.69°* 
TXxX1x3 16 16,531.} 1,033. | 1.54 
TX1x4 16 13,541. 846. | 1.26 
TX2xX3 8 1,764. 220. — 
TX2X4 8 4,863. 608. = 
TX3X4 8 4,289. 536. — 
Error (w) including higher/264 | 177,371. 672. - 
interactions 
— F less than 1. 


* Significant at the .05 level. 
Significant at the .01 level. 
Significant at the .001 level. 


influenced the amount of time spent in com- 
munication, the rate of change of information 
in general, did not. However, when increased 
time was available for communication (de- 
creased load), relatively more of it was utilized 
for sending messages in the high “demand” 
structure (LA) than in the low “demand” 
structure (HA). 

The means and F ratio for trials indicate 
that total talking time increased with practice. 
The increase was greater for the LA structure 
than the HA structure. 


Discussion 


As might be expected, the results cast some 
light on the issues which gave rise to the study, 
but, at the same time, generate additional 
questions. The analysis of error scores sup- 
ports the general hypothesis that “demands” 
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TABLE 6 
SUMMARY OF ANALYSIS OF VARIANCE OF 
“ToraL TALKING TIME” 


Source of Variation | ay | Sunset | Mean. | P Ratio 
Total 431 | 585,627. | — - 
yeen groups 47 | 537,455. | 11,435 — 
A seo i 2 | 255,420. | 127,710. | 87.89°%°° 
Structure 1 | 181,507. | 181,507. | 124.92°°° 
Procedure 1} 8,348. | 8,348.) 5.74° 
Performance 1| 1,846. f 1,846. | 1.27 
1x2 2| 39,580. | 19,790. | 13.62°* 
1x3 | 2 675. 338. { — 
1X4 2| 2,778. 1,389. — 
2X3 1f 430. 430. — 
2x4 1| 2,766. | 2,766. 1.90 
3x4 1 263. 263. — 
i 2S 2| 1,526. 763. — 
1X24 2 801. 400. - 
1X3X4 2| 3,374. | 1,678 1.16 
2X3xX4 1 360. 360. — 
Error (b) including quad.| 26 | 37,781. | 1,453. — 
interaction 
B. Within groups 384 | 48,172. 125. — 
Trials 8| 5,488. 686. | 6.53°* 
Tit 16| 1,176. 74. — 
TXI 8| 6,072. | 759. | 7.22%% 
TX3 8 396, | s| — 
TX4 8 603. | 75. — 
TX1X2 116] 1,722.1 108. | 1.03 
TX1X3 |16) 2,380. | 149. | 1.41 
TX1X4 16 762. 48. = 
TX2x3 8 459. S7: — 
TX2X4 8 385. 48. — 
TX3X4 8| 1,108. 138. | 1.31 
Error (w) including |264| 27621. 105. - 
higher interactions 


— F less than 1. 

* Significant at the .05 level. 

** Significant at the .01 level, 
*** Significant at the .001 level, 


imposed by either the group’s structure or 
task will affect mean performance level as well 
as the rate of error reduction. Increasing 
the communication interdependency between 
group members and/or increasing the rate of 
information input, increases the average num- 
ber of errors and depresses the rate of improve- 
ment with practice. In addition, over-all 
group performance appears to be differentially 
effected by the procedure used for routing in- 
formation, depending upon both structure and 
load. A volunteering procedure is better than a 
soliciting system only for the intermediate 
levels of “demand” imposed by structure and 
input variations. 

The chief influence of structure and task 
demands, however, appears to be in setting 
the range over which learning occurs. The 
form of the curves for all experimental condi- 
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tions is generally quite similar; the parameters 
appear different. For none of the conditions is 
there evidence of discontinuities in the trends. 
Since the task appears to afford opportunity 
for rapid improvement in performance if 
optimum operating procedures are adopted, 
the lack of discontinuities may indicate that 
“insightful” reorganization did not occur, even 
under relatively low demand conditions. 

Such an inference is in accord with the re- 
sults obtained in several communication net 
studies. The net employed in the present study 
is, in essence, an all-channel or totally con- 
nected net with varying degrees of channel 
usage imposed by structure variations. 
Christie eż al. (4) found some evidence to sup- 
Port a quasi-discontinuous notion of group 
learning in many of the nets they investigated, 
but little evidence of such a phenomena in a 
totally connected network. Guetzkow and 
Simon (5) demonstrated that groups in all- 
channel nets take relatively longer to achieve 
a stable and efficient organizational procedure 
than groups in wheel nets. Presumably as a 
consequence, the all-channel groups produce 
more continuous acquisition curves, while the 
wheel groups show rapid improvement in per- 
formance in the early trials and very little 
subsequent improvement. 

The Guetzkow and Simon findings suggest 
that discontinuities may occur in the learning 
curve under conditions which facilitate the 
emergence of “optimum” operating procedures. 
In their study, the restrictions in communi- 
cation which reduced the number of open 
channels, created an asymmetric position, aM 
decreased the number of necessary relays, 
accomplished this. In the present experiment, 
elimination of unnecessary channels (possible 
in the HA Structure) or restrictions on the 
relative availability of channels could possibly 
assist the development of efficient procedures; 
and if these Procedures emerged, discontinu- 
ities in the learning curve might result. 

In general, the analysis of communication 
measures does not reveal as clear-cut effects 

* It is possible, of course, that real progress in UD 
covering the factors influencing group learning W! 
depend upon more detailed analysis of the component 
Processes involved. In the present study error scores 
are composed of contributions from several “process 
sources (e.g., translating instructions, detecting instru- 
ment reading changes, transmitting information) 2% 


Pa ae Se 
well as several individuals, and errors from the 
various sources may smooth upon summation. 


PN, 


Group LEARNING AND COMMUNICATION 


for the experimental conditions as does the 
error data. Surprisingly, the rate of change of 
input (load) effects no significant changes in 
communication behavior, although a longer 
total time available for communication in- 
creases the number of messages transmittea 
and the total time spent in communicating. 
Since the amount of information to be trans- 
mitted per trial is equal for all the loads it 
would be of interest to know what function is 
served by the larger number of messages under 
decreased load. The most plausible hypothesis 
is that they are redundant messages used for 
checking.” 

Structure variation, which is coordinate 
with the volume of information to be trans- 
mitted, produced differences in all three 
measured aspects of communication behavior. 
In the LA structure more messages are sent, 
their average length is greater, and the total 
time spent in communicating is longer than 
in the HA structure. The differences are to be 
expected in view of the larger volume of 
information which had to be transmitted in the 
LA structure; however, the relative differences 
(approximately 134:1) are not proportional to 
the differences in the required volume of 
information to be transmitted imposed by the 
two structures (4:1). Thus, relative to the 
demands imposed by the structure, the 
volume of transmitted information is much 
greater in the HA structure than in the LA 
structure. Since in both structures the amount 
of transmitted material was greater than that 
required by the task, it is probable that the 
differences reflect a much larger proportion of 
redundant messages in the HA structure than 
in the LA structure. 

The difference is similar to that obtained for 
task load and appears to indicate that as 
demands decrease groups utilize the “free” 
time for checking activities. In view of the 
performance differences between load and 
structure conditions, this suggests that the 
primary determinant of performance adequacy 
was the degree of redundancy permitted by 
task and structure demands. The assumption 
that “free” time was employed in checking 
activities may also account for the lack of 
development of more efficient operating pro- 
cedures. If, under the continued press of task 
all available time is devoted to 


requirements, 
relevant, reassessment 


activities directly task- 
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of over-all procedure is unlikely. If true, the 
provision of a break in the task during which 
group members could freely discuss their 
operation may be the most general method to 
facilitate the development of more efficient 
operating procedures. 

With respect to the changes in communica- 
tion behavior over trials, it is seen that with 
increasing practice more messages are trans- 
mitted, their average length decreases, and 
the proportion of the trial interval used for 
communication increases. Apparently, groups 
learn to “pack” more communication into a 
trial interval, but the rate of learning is rela- 
tively uniform. The change in “packing” 
efficiency could reflect the operation of several 
factors: the adoption of a coding procedure, 
the elimination of irrelevant material from 
messages, or increased efficiency in performing 
the observation and control action functions 
allowing more attention to be directed to 
communication. Whatever the source, there is 
some evidence that the more effective groups 
“pack” more efficiently than the less effective 
groups (they transmit shorter messages and 
use a larger proportion of the trial interval for 
communication). 

Two aspects of these results are perplexing. 
One of the consistent findings of the com- 
munication net studies has been a decrease in 
the number of messages transmitted with 
increasing practice. Yet, in this study, com- 
munication behavior changes have as their 
common consequence an increased flow of 
information, even though in early trials the 
number of messages is much larger than re- 
quired by the task. Several existing differences 
between the net studies and the present 
experiment may possibly account for the 
discrepancy. 

The net studies employ a self-paced task 
with information from multiple sources to be 
screened for a single decision. The use of 
message cards obviates the need for storage of 
the transmitted material on the part of the Ss. 
The S’s problem, therefore, is essentially in 
selecting targets for messages, and as efficiency 
in selection increases, the number of messages 
required for solution should decrease. The 
present study uses a forced-paced task (the 
periodic input to the group in no way depends 
on antecedent decisions) with multiple informa- 
tion inputs and multiple required decisions. 
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The information is transmitted verbally and 
no permanent record of messages is provided. 
Since every decision is dependent on collated 
information, a given message must reach S 
when he is free to receive it and relatively in- 
phase with other required information. The 
choice of a recipient being a trivial problem, 
the primary difficulty appears to be one of 
“phasing.” 

Rationally, it would seem that several 
procedures could reduce errors due to phasing 
difficulties: (a) advantage could be taken of 
predictable features of the input and the group 
could phase its information transmission with 
the predictable order of change in the informa- 
tion categories; (b) the source of information 
could initially transmit information once and 
repeat when requested; (c) the source could 
transmit repeated messages, thus allowing a 
recipient to sample this semicontinuous flow 
at his discretion. 

Procedure (a) should lead to decreased in- 
formation flow which is not consistent with 
the present results or with results obtained in 
a previous study which employed higher 
degrees of input predictability (7). Procedure 
(b) should result initially in greater volume 
of transmitted information than theoretically 
required although the flow should decrease 
with practice. Since communication differences 
obtained between the volunteering and solicit- 
ing procedures indicate that a strict volunteer- 
ing procedure is not maintained, groups appar- 
ently did employ a procedure like (b). It is 
unlikely, however, that (b) alone could account 
for the increased information flow. Groups 
probably also reacted to the phasing difficulties 
by utilizing a technique such as (c). 

The second perplexing aspect of the com- 
munication results is the apparent indepen- 
dence of the rate of change of communication 
behavior and the demands imposed by struc- 
ture and task load. It seems justifiable to 
assume that environmental demands would 
be coordinate with drive strength and, thus, 
the greater the demands the more rapid the 
learning. Possibly, due to some process of social 
facilitation, the motivation for laboratory 
groups is higher (or lower) and less variable 
than is typical with individual Ss and the 

influence of external factors is attenuated. At 
any rate, further research on the relationships 
between the rate of modification of important 
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instrumental behaviors and group motivation 
and environmental demands is indicated. 

The apparent independence of the rate of 
change of communication behavior and de- 
mands raises another more specific problem. 
Since the rate of error reduction is a function of 
demands, we are left in the peculiar position 
of trying to account for different rates of error 
reduction in terms of nondifferential rates of 
acquisition of important part-skills. One 
parsimonious explanation is that a given level 
of improvement in communication behavior 
has different consequences for error reduction 
in the various conditions. As the amount of 
information to be transmitted per unit time 
decreases, a given increase in communication 
efficiency could result in a proportionally 
greater decrease in errors. However, a precise 
statement of the relationship between rate of 
error reduction, demands of the task, and rate 
of improvement in communication behavior is 
impossible in the present study, and possibly 
in most studies of group learning. Methodo- 
logically, the result highlights the difficulty of 
accounting for group learning curves, based 
on traditionally employed error or time criteria, 
in terms of important part-processes. Similar 
difficulties have been encountered by other 
investigators (4, 8). 

In conclusion, it should be pointed out that 
the usual strictures against overgeneralizatio? 
apply quite literally in the present study- The 
experimental situation investigated was con- 
siderably more restricted than most gt0UP 
laboratory situations and the task problem 
was more structured than problems typica ly 
faced by either “teal” groups or other tem- 
porary groups. It is interesting to note how” 
ever, that despite such restrictions, ™UC 
of the “explanation” of the data hinges °” 
retrospective analysis of task requirements 
Until a framework for the analysis of ee 
task is available, hypotheses concerning 
relationships between group structure ae 
process, task demands, and group performan ‘i 
are going to be difficult to formulate and eve 
more difficult to test. 


SUMMARY , 
arliet 


The present study was an extension of € 
experiments on the relationships between ae 
munication structure and group performance 
an information processing task. The focus w 
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on changes in error performance and certain 
aspects of communication behavior as a 
function of practice under several levels of 
“demand” imposed by the group’s communica- 
tion structure and the rate of change of in- 
formation input to the group. The analysis of 
results indicated: 

1. Mean error performance and learning 
rate were a function of both communication 
structure and input load with better per- 
formance and faster rates for low demand 
conditions. 

2. There was no evidence of discontinuities 
in the learning curves for any of the conditions 
and by inference no evidence that “insightful” 
reorganization occurred, even under low 
demand conditions. 

3. All three aspects of communication 
behavior measured—average number of mes- 
sages transmitted, average length of messages, 
and proportion of the trial interval spent in 
communication—varied with communication 
structure, but only the last varied with input 
load. There were changes in all three aspects 
with practice: more messages were transmitted, 
their average length decreased, and a greater 
proportion of trial time was spent in com- 
munication. Groups appear to learn to “pack” 
more information in a fixed interval with 
practice. However, there were no differences 
between conditions in the rate of change of 


communication behavior. 
hemselves under con- 


Apparently, left to t l à 
stant situational pressures, groups modify their 
behavior at a characteristic rate which is 
independent of the demands placed on them. 
However, depending on the nature of the 
task, a given rate of behavior modification 
may have different consequences for the rate of 


improvement in some performance criterion 


such as errors. 
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CRITIQUE AND NOTES 


A FURTHER STUDY OF THE EFFECT OF STRESS ON PALMAR PRINTS 


ROBERT H. DAVIS 
The RAND Corporation 


N a recent publication, Mowrer (3) has de- 
I scribed a method for obtaining palmar prints. 
Presumably densitometer measurements of such 
prints vary directly with emotional upset and re- 
laxation. The terms emotional upset and relaxa- 
tion are obviously difficult to tie down, but ina 
recent study (1) palmar prints were taken during 
periods which were “temporally proximal” _to 
what are generally agreed to be stress inducing 
situations, oral examinations for the doctorate. 
Since the study reported by Beam was design- 
ed primarily to investigate the effect of real-life 
stress on conditioning and learning, the results 
with respect to palmar sweat under stress and non- 
stress conditions were not reported directly. It was 
demonstrated, however, that stress could be di- 
rectly related to trials to reach a criterion of mas- 

tery, and furthermore, that the greater the in- 
crements in palmar Sweating, the greater the 
increments in trials, 

The present study was designed to test directly 
the hypothesis that real-life stress can be related 
to the index of palmar sweat. All measurements 
were taken during a period of stress rather than 
during a temporally adjacent period as in the study 
of Beam, Briefly, the palmar prints of a group of 
34 college students were taken during an exami- 


1 The study described was conducted while the 


author was a member of the faculty of Allegheny Col- 
lege, Meadville, Pa. 


nation and contrasted with control prints of the 
same Ss which were taken cither two weeks prior 
to the examination or two weeks after the exami- 
nation. The method utilized for taking the palmar 
prints was that described by Mowrer (3). Quan- 
titative values were assigned to the prints by 
averaging five Kodak densitometer readings, one 
on each of the four fingers and the thumb. Rater 
reliability was assessed, using the rank-correlation 
coefficient. A Separate coefficient was calculated 
for the experimental and control prints. Their 
values were .81 and .83 respectively. A nonpara- 
metric test was applied to the data. When palmar 
prints taken under the two conditions were con- 
trasted using the sign test (2), densitometer values 
of the examination prints exceeded those of the 
control prints in 27 out of 34 cases (p < .01), 
lending further support to the hypothesis that 


palmar prints can be meaningfully related to con- 
ditions of real-life stress. 
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SITUATIONAL DIFFERENCES IN LEADER BEHAVIOR? 


A. PAUL 


HARE 


Harvard U; niversity 


TR types of leaders have been contrasted in 
the child study literature since the early 
1930’s, one group-oriented, the other, self-orient- 
ed (1, 6, 10, 11). In the present Tesearch, sys- 
tematic observations of boys of each type are 


1 Paper read at the annual meeting of the American 
Sociological Society, September, 1955. The Present Te- 

rch is part of a continuing study of the emotional 
An lopment of children which has been undertaken 
a last five years by members of the Harvard 
Scho l if Public Health and the Human Relations Sery- 
- eee The author is indebted to Erich Linde- 
ice 
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made on the school playground, where these i 
terns have been observed in the past, and in t 


more intimate, less competitive situation of the 
back yard. 


mann, the project sponsor, for his continuing supports 
to Donald Klein, Elizabeth Lindemann, and Ann Ros 
for their collaboration in developing the research design, 
and to Barbara Buckstein for carrying out half one 
field observation. The research was completed while 
the author was a Public Health Research Fellow of the 
National Institute of Mental Health. 


SITUATIONAL DIFFERENCES IN LEADER BEHAVIOR 


The definitions of group and self-oriented leaders 
are derived from three previous studies by Gruber 
(7), Lindemann and Ross (9), and Vaughan and 
Faber (12) who use the same population of children 
from which the present sample is drawn. The 
group-oriented leader can be described as one who 
leads on the basis of self-confidence, has a clear 
perception of the requirements of the situation, 
and can play a follower’s role. The self-oriented 
leader is anxious, aggressive, and has a driving 
need to be in the center of the group at all times. 
When these general definitions are described in 
terms of Bales’ categories (2), the group-oriented 
leader would be expected to be higher than the 
self-oriented leader in the positive and neutral acts 
of categories 1 through 9 and lower in the negative 
acts of categories 10, 11, and 12 (see Table 1). 

Previous studies have not indicated the nature 
of the input or the average behavior of other group 
members directed toward the leader. However, 
Lindemann and Ross have observed that each 
leader type tends to attract followers of the same 
type. One would, therefore, expect the input and 
output profiles to be similar for each leader style? 

Since most of the normative studies of child 
behavior are based on observations made in the 
school setting, situational differences in behavior 
are not reported. Barker and Wright have noted, 
however, that “stable, non-psychological situations 
such as Sunday school, day school, etc. coerce the 
children to behave in relatively homogeneous ways 
regardless of the individual characteristics of the 
children” (4). 

Although the observations of boys on the school 
playground in the present study are in part a 
validation of the clinical judgment of certain 
teachers and social scientists, the more important 
objective of these observations is to delineate in 
detail the behavior of boys in two different natural 


settings. 
METHOD 
ted leaders and six self-oriented 


leaders were selected from the third grades of four New 
England public schools on the basis of a teacher’s rating 
and ratings by two previous observers of the boys.* 
The teachers were first asked to identify the boys in 
their classes who were leaders on the playground, sec- 

ether the boys tended to be group- 


ond, to indicate wh s 
oriented or self-oriented, and third, to rank the boys 
of each type. From the teacher’s list, boys were selected 


who had also appeared to be group- or self-oriented in 


revious research by the author (8), two leader 
d by boy leaders leading other boys 
in discussion groups. Although the major distinction 
between the two types of leaders was the amount of 
participation in the discussion, the results indicate that 
boys with different leader styles can produce significant 
differences in the interaction of their peers. 

3 Elizabeth Lindemann and Ann Ross had observed 
these children in the school over a period of three years. 


Six group-orien 


2Inp 
styles were impose! 
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their behavior in the research of the previous investi- 
gators. 

z During the winter and spring of 1954 these twelve 
nine-year-old boys were observed on the playground 
during a free play period and in the neighborhood, 
using the set of categories for interaction process anal- 
ysis developed by Bales (2). (See Table 1.) The field 
observation technique differed from that of Barker (3) 
in that (a) interaction was scored rather than taken 
down verbatim, and (b) individual activity apart from 
the group was not scored. An observer scored only one 
boy at a time, recording all of his acts directed to others 
and all acts directed to him by other group members. 
When possible, each of two observers made half of the 
observations on each subject in each situation. Although 
the interobserver reliability approached but did not 
meet the standard suggested by Bales (2) at the begin- 
ning of the research, it was well above the standard by 
the time the last boy was observed.‘ 

The average observation period on the school play- 
ground was four sessions for a total of 47 minutes (149 
acts) and in the neighborhood was four sessions for a 
total of 75 minutes (218 acts). Interaction with teachers, 
parents, or observers was not recorded; only the inter- 
action with other children was included in the profiles. 
Nonverbal interaction called for by the rules of the 


game was also omitted. 
Bales’ categories were adapted for field use by re- 


moving the restriction that category 4, “Gives sugges- 
tion,” implies autonomy for the other. 


RESULTS 


The average percentage of the acts in each cat- 
egory by leader type and situation given in Tables 
1 and 2 is obtained by converting the individual 
percentages by the arc sine transformation, com- 
puting the means, and reconverting to percentages. 
In the analysis of the data, however, the arc sine 
values are used. 

An analysis of variance for each category of acts 
for the output data (Table 1) (acts directed by the 
leader to other group members) indicates that the 
group-oriented leaders give significantly more sug- 
gestions than the self-oriented leaders.’ They also 
show less tension, a difference which approaches 


significance. 

No significant differences appear in the input 
(acts received by the leader from the other mem- 
bers) between the two leader types (Table 2). 


4 When samples of mutually observed acts were com- 
pared, the reliability measure computed at the begin- 
ning of the research was 5.40 which exceeds the limit of 
5.3 for 6 degrees of freedom suggested by Bales. By 
the end of the research the statistic was reduced to 2.74 
where the maximum suggested is 6.3 for 7 degrees of 
freedom. 

5 The differences considered significant in this re- 
saerch have the probability of <.05 of occurring by 
chance. Probabilities of <.10 are referred to as ap- 
proaching the significance level. A “one-tail” test is 
used only when the direction of the differences is pre- 


dicted. 
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TABLE 1 
MEAN PERCENTAGE OF INTERACTION OUTPUT FROM 
GROUP- AND SELF-ORIENTED Boy LEADERS 
FOR PLAYGROUND AND NEIGHBORHOOD 


Playground Neighborhood 

Interaction Category G.0. | s.0. | G.0. | §.0. 
Ldrs. | Ldrs. | Ldrs. | Lars. 

1. Shows solidarity 5.7 4.3 2.8 4.3 
2. Shows tension release 6.4 8.5 7.8 6.9 
3. Shows agreement 3.9 2.1 4.3 3.8 
4, Gives suggestion 23.7 15.9 20.6 11.5 
5. Gives opinion 7.2 7.3 6.5 8.0 
6. Gives information 17.9 19.2 31.3 32.2 
7. Asks for information i:s 1.3 6.9 10.8 
8. Asks for opinion 0.0 0.0 0.6 0.1 
9. Asks for suggestion 0.0 0.0 0.1 0.0 
10. Shows disagreement 1.4 3.1 4.3 3.2 
11. Shows tension 4.3 10.5 3.0 4.5 
12. Shows antagonism 20.1 6.1 10.5 


* Average number of acts per boy is 149 on the playground 
and 218 in the neighborhood. 

t The raw percentage is converted by the arc sine transforma- 
tion for calculation and reconverted for the table. For this reason 


the sum of percentages for each column does not equal exactly 
100. 


TABLE 2 
MEAN PERCENTAGES OF Interaction Input TO 
GROUP- AND SELF-ORIENTED BOY LEADERS 
FOR PLAYGROUND AND NEIGHBORHOOD* 


Playground Neighborhood 

Interaction Category [——_———_—_—_—_ 
G.-0. | S.-0. | G.-0. | s.-0. 
Ldrs. | Ldrs. | Ldrs. | Lars. 
1. Shows solidarity 8.4 8.5 5.4 3.8 
2. Shows tension release 0.8 2.3 4.1 4.9 
3. Shows agreement 7.8 7.8 6.3 3.4 
4, Gives suggestion 12.4 10.0 16.0 14.1 
5. Gives opinion 7.6 8.5 7.2 7.3 
6. Gives information 8.7 14.7 32.4 32.3 
7. Asks for information 2.7 2.1 10.6 8.6 
8. Asks for opinion 0.0 0.0 1a 0.2 
9. Asks for suggestion 0.0 0.2 0.1 0.1 
10. Shows disagreement 3:3 5.7 4.4 4.8 
11. Shows tension 0.8 1.0 0.6 1.4 
12. Shows antagonism 24.0 24.3 47 13.4 


* Average number of acts per boy is 55 on the playground and 
142 in the neighborhood. 


Some additional differences between the two 
leader styles appear in only one of the two situa- 
tions. The self-oriented leaders give and receive 
more antagonism than the group-oriented leaders 
in the neighborhood, a difference which approaches 
significance for output (Table 1) and is significant 
for input (Table 2). The self-oriented leaders also 
receive significantly less agreement in the neighbor- 
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hood and receive more information on the play- 
ground (a difference which approaches significance) 
(Table 2). 

More significant differences in output appear 
between the two situations for the twelve leaders 
(Table 1). Giving and asking for information are 
both greater in the neighborhood and showing 
antagonism is greater on the playground. Differ- 
ences in giving suggestion, disagreement, and 
showing tension approach significance with more 
suggestions and tension on the playground and 
more disagreement in the neighborhood for both 
leader types. 

Some of the same differences between situations 
also appear in the input data (Table 2). Signifi- 
cantly more information is given and asked for in 
the neighborhood and more antagonism is shown 
on the playground. 

The consistency of the behavior of each leader 
is analyzed by comparing his rank order in each 
category (a) for output between situations, (b) for 
input between situations, (c) for output and input 
on the playground, and (d) for output and input 
in the neighborhood. Only 54 of the 80 correlations 
thus generated are positive and of these only 11 
are statistically significant, i.e., greater than .78 
(correlations for categories 8 and 9 were not com- 
puted). This reflects a general lack of consistency 
in the observed behavior, less than that reported 
for adult subjects by Borgatta and Bales (5). 


SUMMARY AND CONCLUSIONS 


Six group-oriented and six self-oriented boy 
leaders are observed on the school playground and 
in the neighborhood, using Bales’ categories for 
interaction process analysis. The clinical judgments 
of teachers and social scientists are not entirely 
validated since boys of both leader types show the 
Same amount of aggression on the school play- 
ground, although the self-oriented leader is more 
aggressive in the neighborhood. Other expected 
differences appear since the self-oriented leader 
gives fewer Suggestions and shows more tension in 
both situations. 

It is possible that the group-oriented leaders are 
Perceived as less aggressive on the school play- 
ground because they have a more legitimate reason 
for aggression in the interest of group control. The 
self-oriented leader uses his aggression to een 
his central position so that he is usually describe 
as “more aggressive,” although both types vi 
the same number of aggressive acts on the schoo 
playground. 

The major finding of this research is that there 
are more differences between situations than be- 
tween the two leader styles. For the twelve aie 
year-old boy leaders in this sample, giving an 
asking for information and disagreement are higher 
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in the neighborhood, and giving suggestions, show- 
ing tension, and showing antagonism are higher 
on the playground. Differences in input are in the 
same direction, with more information given and 
asked for in the neighborhood, and more antago- 
nism shown on the playground. 

The higher amount of information in the neigh- 
borhood probably results from the fact that the 
children spend more time making up games, while 
the increase of suggestion, tension, and antagonism 
on the playground reflects a more competitive 
situation in which the individual must assert him- 
self to maintain his status in a large group com- 
posed of children his own age. 
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ECENT research hi 
of psychological 
lates of physiological 


as pointed to a number 

and sociological corre- 

reactions during acute 
stress. In terms of cardiovascular response we 
have demonstrated a positive association be- 
tween anger directed outward and a nor-epineph- 
rine-like cardiovascular pattern and between anger 
directed inward or anxiety and an epinephrine-like 
pattern (3). We have also shown that a subject s 
perception of his father as stern and dominant in 
discipline was associated with a nor-epinephrine- 
like cardiovascular reaction while perception of 
father as mild and nondominant in discipline was 
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RACTICE AND CARDIOVASCULAR REACTIONS DURING STRESS! 


DANIEL H. FUNKENSTEIN 
Harvard Medical School 


associated with an epinephrine-like pattern (6). 
This report describes a third correlate of cardio- 
vascular reactions in acute stress, that of religous 


practice and attitudes. 


METHOD 


The data on which these findings are based were 
gathered in a two-year study of healthy college students 
during acute laboratory stress. The details of this study 
have been reported elsewhere (2, 3) but in brief the 
procedure was as follows. One hundred twenty-five 
randomly selected male students were placed in lab- 
oratory stress situations where they were given arith- 
metical problems to do in their heads while being 
hurried and criticized by the experimenter, or they were 
asked to repeat a story from memory while talking into 
an auditory feedback device. Cardiovascular measure- 
ments by means of a blood pressure cuff and a ballisto- 
cardiograph were obtained before and after each session 
and the Ss were interviewed at the end of the stress 
situation for a measure of their emotional reactions. In 
addition, each S was seen by the Zs on other occasions 
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where a battery of psychological tests and a biographi- 
cal questionnaire were administered. 

On the basis of relative changes between the pre- 

stress and stress measures of blood pressure and ballisto- 
cardiograph tracings the Ss could be divided into two 
main groups comparable to those described earlier by 
Funkenstein and Meade (4). In their study blood pres- 
sure and ballistocardiograph tracings were taken from 
normal Ss during the infusion of nor-epinephrine and 
during the infusion of epinephrine. The various physio- 
logical measures fell into two distinct Patterns under 
each condition, characterized by changes in pulse, mean 
IJ wave on the ballistocardiograph, and mean blood 
pressure. The exact procedure for the determination of 
these two patterns can be found elsewhere (4, 5). The 
pattern of physiological variables in one group of Ss in 
the present study was like that found by Funkenstein 
and Meade during the infusion of nor-epinephrine and 
is thus termed nor-epinephrine-like. The other group 
had a pattern of variables comparable to that found 
during the infusion of epinephrine, and we have termed 
it epinephrine-like. 

Data on religious practice and attitudes were ob- 
tained from the personal history questionnaire and the 
Religious Conventionalism scale of Levinson. 

On the questionnaire each S was asked about the 
churchgoing behavior of his parents through the follow- 
ing questions: 

1. How frequently does your father attend church? 
a. Frequently; b. occasionally; c. seldom. 

2. How frequently does your mother attend church? 
a. Frequently; b. occasionally; c. seldom. 

Responses to these questions were combined into a 


single measure of parental church attendance, as fol- 
Ows: 


son has described this particular scale as follows (7): 
The Religious Conventionalism (RC) scale contains 
a variety of ideas about the church, the Bible, God, 
prayer, and the like. High scores reflect a conception of 
God as punishing power figure and of the church as 
absolute moral authority, as well as a marked emphasis 
on faith, tradition, and conformity to institutional 
forms. Low scores, on the other hand, represent a Te- 
ligious humanistic or a nontheistic approach which 
emphasizes reason, personally derived values, and a 
naturalistic rather than supernaturalistic view of the 
world. x 
The S is presented with 12 statements in the area 
of religious conventionalism and asked to indicate the 
degree of agreement or disagreement with each state- 
ment on a seven-point scale ranging from strongly agree 
to strongly disagree. He can receive a score of 1-7 on 
each statement or a range of 12-84 on the total. 
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TABLE 1 
PARENTAL CHURCH ATTENDANCE AND CARDIOVASCULAR 
REACTIONS OF SuBJEcTS DURING STRESS 


Cardiovascular Pattern of Subject 
Parental Church 


Attendance Nor- Total 
epinephrine-like Epinephrine-like 

Regular 20 11 31 
Occasional 13 19 32 
Infrequent 3 12 15 

Total 36 42 78 

X? = 8.719. 

P< 02. 


TABLE 2 


Scores or THE CARDIOVASCULAR Reaction Groups 
ON THE RELIGIOUS ConvEeNTIONALISM SCALE 


Religious 


Cardiovascular Reaction Group Conventionalism Scale 


N Mean SD 


Nor-epinephrine-like 36 


42.94 
Epinephrine-like 41* 


33.63 


19.00 
18.79 


* One subject became unavailable for testing after filling out the 


Personal history questionnaire and before taking the RC scale. 
b= 2.13. 


$ < 05. 


REsULTs 


The association between the measure of parental 
church attendance and the cardiovascular reactions 
of the Ss in stress can be seen in Table 1. The Ss 
whose parents were regular attenders were more 
inclined to have nor-epinephrine-like cardiovascu- 
lar reactions while those whose parents were kea 
casional or infrequent attenders had epinephrine- 
like patterns. This difference was especially striking 
between Ss with regularly attending parents and 
those with infrequently attending parents. The 
chi square in this table was 8.719 which with 2 
degrees of freedom is significant at the .02 level of 
confidence. 

The mean RC scale scores for the nor-epineph- 
Tine-like and epinephrine-like groups are shown in 
Table 2. It will be noted that even though the SD 
was large there was still a significant difference be- 
tween the means with the use of a £ test. Those 
with N patterns had higher RC scale scores than 
those with E patterns, 


Discussion 


These data when combined with those on which 
we have reported Previously indicate that there n 
a constellation of psychological and sociologica 
factors which are associated with the cardiovascu- 
lar reactions of healthy Ss in acute stress. ae 
constellation includes the S’s immediate emotional 


i 
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reaction, his attitudes in the area of religious 
values, his perception of parental behavior in dis- 
cipline, and the churchgoing behavior of his 
parents. The Ss who responded to acute stress 
with a nor-epinephrine-like cardiovascular reaction 
tended to also respond with anger directed at the 
E. In addition, they were individuals who per- 
ceived their father as the principal disciplinarian 
in the family and stern in his discipline, they had 
conservative attitudes in the area of religious faith 
and belief, and they came from families where the 
church attendance habits were regular. On the 
other hand, those Ss who responded to acute stress 
with an epinephrine-like reaction tended to respond 
with anger directed against themselves or anxiety. 
In addition, they perceived their fathers as non- 
dominant in discipline, had moderate to liberal 
attitudes in the area of religious faith and belief, 
and came from families where the pattern of church 
attendance was occasional or infrequent. 

These factors come from a number of levels, not 
only the immediate biological (physiological and 
emotional), but the more enduring organization of 
perceptions (perception of father and attitudes 
toward religion), and even the surrounding family 
milieu (church attendance of parents). 

We leave it to future research to spell out the 
manifold implications of these associations. We do 
suggest that the relationships are of sufficient 


strength to encourage further interdisciplinary re- 

search among the fields of physiology, psychology 

and sociology. 
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CERTAINTY OF JUDGMENT AND RESISTANCE TO SOCIAL INFLUENCE! 


HAROLD H. KELLEY 
University of Minnesota 


anD THOMAS W. LAMB 


Harvard Medical School 


EVERAL studies have indicated that social in- 

fluences have greater effect on judgments the 
more ambiguous is the stimulus being judged (2, 
6). One explanation of this finding is that under 
conditions of ambiguity, individuals are less cer- 
tain about their judgments and, hence, more 
easily influenced. The factor of certainty has been 
studied independently of stimulus ambiguity by 
Hochbaum (4) who found high confidence in an 
opinion to be associated with high resistance to 
influence. The present note presents further evi- 


1 Conducted while the authors were at Yale Uni- 
versity, under the auspices of the Yale Communication 
Research program. The authors are indebted to several 
chemists at Yale and Minnesota for their advice. 


» 


dence on the importance of certainty, obtained 
under conditions where there are marked individ- 
ual differences in reactions to a given stimulus 
and associated differences in certainty of judg- 
ment. 

® The method involved the use of phenylthiourea 
(also known as phenylthiocarbamide), hereafter 
referred to as PTU. This substance is tasteless to 
some individuals but tastes extremely bitter to 
others. After preliminary work indicated that we 
could present PTU in a fairly controlled manner 
and readily distinguish tasters from nontasters,? 


2 Blakeslee (1) has found that tasting behaves asa 
dominant gene and nontasting as a recessive gene. The 
basis for this difference is now thought to exist in the 
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we used it to study the following question: Does 
the effect of a majority depend upon the particu- 
lar opinion they advocate, relative to the opinion 
held by the minority? For example, will the effect 
of the majority’s espousal of “A” when the minor- 
ity initially favors “B” be the same as if the ma- 
jority had supported “B” and the minority, “AM? 
That this problem readily lends itself to the use 
of PTU is apparent: We have merely to form 
groups composed of unequal numbers of tasters 
and nontasters, in some of which the tasters are 
in the majority and in others of which the non- 
tasters predominate. 


METHOD 


A rating form was prepared with three scales of 
“pleasantness of taste,” ranging from 0 (bad) to 10 
(good). The scales were labeled A, B, and C, respec- 
tively, to correspond to three gummed labels, also 
labeled A, B, and C, which were attached to the bottom 
of the forms. Label A was a plain label, and Label B 


interference of PTU with the activity of certain en- 
zymes (7, Volume I, p. 337). It is also thought to be a 
goiterogenic compound if administered on a long-term 
basis, as it inhibits the formation of thyroxine, a hor- 
mone of the thyroid gland (7, Volume I, p. 427). It 
has been used as a food Preservative, but recently in 
this connection the question of its possible carcinogenic 
action has been raised (5), 
PTU can be obtained in a ure form fro: 
Eastman Kodak Compan: oe 


many chemical supply hi 


a minute quantity of it. 
The solution was prepared by diluting to twice its 
volume a saturated solution of PTU in Baker Extract 
Company’s “True Fruit Cherry Flavor” Which hes a 
16% alcohol base. It was then applied with an atomizer 
to the back of Reyburn’s label No. 13 until a deep 
purple color was attained. This rather crude method of 
“dosage” was found to produce quite standard results, 
because of the high concentration of PTU that was 
used. Individual differences in reaction were noted, 
but of the Ss to whom we presented the labels, not one 
exhibited a violent reaction, yet all the “tasters” ex- 
perienced an intense, bitter taste. 
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was the same kind of label coated with pineapple 
flavor. Label C was prepared with a combined coating 
of PTU and cherry flavoring. Scales B and C each had 
an accompanying scale for degree of certainty, ranging 
from 1 (not at all certain) to 7 (very certain). 

The subjects (Ss), Yale University upperclassmen, 
were first seen individually and privately. Each was 
told that we were trying to find out “whether there 
are absolute values for these tastes or whether 
there are individual differences in ratings of them.” 
He was then given a form, asked to taste Label A, and 
instructed to give it a rating of 5 (average) on the 
scale of pleasantness. He was then asked to taste B 
and C, and to rate each one in relation to A. After 
rating B, he was asked to rate his certainty that B 
should be given the pleasantness rating he had given 
it. A similar procedure was followed for C. 

On the basis of these private judgments, the Ss 
were scheduled in 30 three-man groups, half composed 
of one taster and two nontasters and half, of two 
tasters and one nontaster. Because the population of 
Ss yielded a ratio of tasters to nontasters of approxi- 
mately 3:1, about 150 individuals were privately 
tested before the 30 groups could be arranged. In 
scheduling each group, the further requirement was 
made that each of the three men should know the 
other two, the result being that men who lived near 
each other in a given residential college were placed 
in the same group. 

The groups were assembled one to three days after 
the individual testing. The instructions were similar 
to those used earlier, except that certainty ratings 
were not requested and the Ss were asked to announce 
their ratings publicly, each one first giving his rating 
of B, and then of C. Each time, the majority persons, 
whether tasters or nontasters, were asked to announce 
their ratings first. When it came his turn on B, the 
minority person usually found that the others ni 
given ratings similar to his private ones. But on ¢ 
he found that their ratings were markedly differen 
from his own evaluation, 


RESULTS 


_ The data were analyzed only for the 90 Ss used 
in the group portion of the experiment. 

1. The tasters and nontasters agreed very 
closely in their ratings of Label B, the one coate 
with pineapple flavoring. On the average, «che: 
gave it a rating near 7.0 on both the initial private 
ratings and the publicly announced ones. We 
mention this to show that the tasters and non- 
tasters are comparable in their reactions to 4 
substance not containing PTU and that they use 
the rating scale in much the same manner. é 

2. The tasters’ initial private ratings of C, ih 
experimental label coated with PTU and cherry 

voring, were markedly lower than the ratings 
made by the nontasters. The mean rating for the 
45 tasters was 0.8 and for the 45 nontasters, 6 4 
A £ test of this difference yields a value of 32. 
which is obviously significant. The two distribu- 
tions of ratings do not overlap at all. The range © 
ratings given by the tasters is from 0 to 2; for the 
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nontasters, from 4 to 8, with 42 of the 45 giving 
ratings from 6 to 8. 

3. The public judgments of Label C were first 
examined for the Ss in each of the 30 groups who 
announced their ratings first. These ratings, when 
compared with the private ones made by the same 
30. persons several days earlier, indicate the sta- 
bility, of these taste judgments in the absence of 
experienced disagreement. The scatterplot of pri- 
vate versus public (early vs. late) ratings reveals 
two separate clusters of Ss, the tasters and the 
nontasters. The product-moment correlation be- 
tween the early and late ratings is +.98. Put in 
other terms, the data showed that 19 of the 30 Ss 
gave identical ratings on both occasions, 10 gave 
ratings which differ by one point, and 1 gave 
ratings differing by two points. The results were 
virtually identical for the Ss (also members of the 
majorities) who announced their ratings second. 
It is quite clear, then, that in the absence of pres- 
sure to change, the private ratings were highly 
predictive of the later public ones. 

4, The public ratings of C were next examined 
with respect to the changes from the earlier pri- 
vate ratings exhibited by the Ss who announced 
their judgments last. Since these Ss were always 
in the minority, these changes indicate the effects 
of the majority opinions. There is a marked differ- 
ence between the minorities composed of tasters 
and those composed of nontasters. The average 
amount of change for the tasters was 0.0, whereas 
the average for the nontasters was —2.6, the 
negative sign indicating that they made a lower 
(less “pleasant”) rating in the group situation. 
There is clearly a significant difference between 
the two kinds of Ss in their tendency to go along 
with the majority opinion. Only 4 of the 15 tasters 
shifted in the indicated direction (toward a higher 
rating) while 13 of the 15 nontasters shifted to- 
ward a lower rating (with Yates’ correction, chi 
square = 8.7, p = (004). It appears, then, that a 
taster majority has a greater effect on a nontaster 
minority than a nontaster majority has on a taster 
minority. 


Two possible bases for this difference are sug- 


gested by our data, only the second of which 
seems to be important. First, it happened that 
the average discrepancy between the ratings an- 
nounced by the majority members and the private 
rating previously given by the minority member 
was somewhat larger in the groups where the 
tasters were in the minority. While this difference 
between the two kinds of groups approaches sig- 
nificance (p = approx. .06), we doubt that it 
accounts for the observed effect. It seems unlikely 
that the minority tasters would have changed less 
simply because the majority members’ opinions 
were somewhat farther away from their private 
ones. In fact, within each sample, there seems to 
be the opposite tendency: greater change tends 


to be associated with a larger discrepancy. If we 
compare Ss for whom the discrepancy was identi- 
cal in size, there is still a marked tendency for 
nontasters to exhibit more change than tasters. 
Second, tasters and nontasters were found to 
differ in how certain they were about their ratings 
of the pleasantness of Label C. The average cer- 
tainty rating for the tasters was significantly 
higher than that for the nontasters (6.6 as com- 
pared with 5.4; £ = 6.25, p < .001). This differ- 
ence was also significant, at the .01 level, when 
only the Ss used as minority members were com- 
pared. This result suggests that the reason for the 
tasters’ great resistance to majority influence may 
be found in the greater intensity of their opinions 
about PTU. The correlation between change and 
certainty within each sample is equivocal on this 
point because the range of both variables is se- 
verely restricted when we consider only the tasters 
or nontasters. We may note parenthetically that 
for both Labels B and C, certainty shows the 
typical U-shaped relation to extremity of rating 
on pleasantness. The closer a person rated the 
flavor to the extreme ends of the scale, the more 
certain he was of the correctness of his rating. 
The eta coefficient is highly significant in both 


cases. 
SUMMARY 


In a simple social influence problem involving 
judgments of taste with PTU the critical stimulus, 
it was found that the effects of majority opinion 
were markedly different depending upon whether 
the minority persons were tasters or nontasters 
of PTU. The data suggest that this asymmetric 
effect, whereby nontasters are more susceptible to 
majority influence, may be attributable to the 
stronger reactions tasters have to PTU and the 
resulting greater certainty they have about their 
judgments of the substance. 
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TWO KINDS OF ASSUMED SIMILARITY BETWEEN OPPOSITES! 
IVAN D. STEINER AND COLIN G. McDIARMID 


University of Illinois 


N a series of recent studies, F. E., Fiedler (3, 4, 
I 5) has shown that a leader’s “assumed simi- 
larity between opposites” (ASo) is an important 
correlate of his team’s efficiency. Basketball teams 
and surveying teams were shown to be more effi- 
cient when the  sociometrically most-chosen 
member attributed very different psycholozical 
traits to persons he regarded as good and bai co- 
workers. The efficiency of bomber and tank crews 
was also found to be related to the leader’s ASo, 
This paper reports an investigation of some statis- 
tical and psychological properties of ASo. 

ASo is a measure of the magnitude of the differ- 
ences between two profiles. One profile represents 
the individual’s ratings of a good co-worker on 
twenty or more six-step graphic rating scales. The 
ends of each scale are identified by polar terms 
such as “energetic” and “tired.” The second profile 
consists of the individual’s ratings of a poor co- 
worker on the same graphic rating scales. The ASo 
score is obtained by summing the squares of the 
distances separating these two Profiles from one 
another (i.e., the distances Separating the ratings 
given to the good co-worker from the ratings given 
to the bad co-worker). Thus: 


N 
ASo = 3 (¢ - 0)? 


when (g — b) equals the discrepancy between the 
ratings given to the good and the bad co-workers 
on one of the graphic rating scales, and N equals 
the number of rating scales employed. The ASo 
Score may be broken into two component parts (2): 


AS 

T= Me ~ Mi)? + 02, 
where M is the mean of the several ratings given 
— M,)? component is the 


» We Can view this com- 

ponent as reflecting the individual ’s tendency to 
regard one co-worker as endowed with more desir- 
able attributes than the other, The second com- 
ponent measures the degree to which 
profiies have different shapes, regardiess of their 
average elevations. This component may be viewed 
as reflecting the individual’s tendency to regard 
his two co-workers as possessing different combina- 
tions of good and bad traits. 


the two 


1 This research was partially supported by a grant 
from the University of Illinois Research Board. 
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There is no logical necessity for these two com- 
ponents of the ASo score to be correlated. It is 
entirely possible for a small or zero average eleva- 
tion difference to occur along with a large difference 
in profile shapes. And a large average elevation 
difference could occur with a small difference in 
profile shapes. The ASo Score is, therefore, a gross 
measure which combines two logically independent 
measures. The purpose of this research is to deter- 
mine whether the two components of ASo are also 
empirically independent of one another, and, if so, 
to determine whether the two components corre- 
late differently with other measures. The other 
measures employed in this research include scores 
on the California F scale (1), and scores obtained 
from self-ratings. Reasons for expecting scores on 
the F scale to correlate with one or both com- 
ponents of ASo will be discussed later, 


Metnop 


Forty-two members of an undergraduate class in 
psychology served as subjects (Ss) for this study, but 
because of omissions from the data, the Ns involved 
in the reported results range from 38 to 42. The two 


Sexes were approximately equally represented in the 
usable samples. 


ASo Scores 


Ss responded to Fiedler’s instrument consisting of 
20 graphic rating scales, the ends of which were identi- 
fied by polar terms, As is customary with this instru- 
ment, Ss rated themselves, a good co-worker, and a ba 
co-worker on each of the 20 scales. The Ss were aske 
to think of someone with whom they had been able to 
work very well on some occasion in the past, and © 
someone with whom they had had difficulty working- 
Each S’s ASo Score was computed by obtaining the 


sum of the squared distances separating his ratings 
these two co-workers, 


Elevation Difference Scores 


In order to compute the (M, — M+) component of 
ASo it was necessary to obtain judgments concerning 
which end of each Scale was regarded by S as the ae 
end. For this purpose each S rated himself on ieam, 
instrument two additional times, once giving bige 
the benefit of any doubts whatever, and once mor 
giving himself the benefit of any doubts. Inspection a 
these two sets of self-ratings permitted a determinatio? 
of the good-bad direction of the scale for the S. Raine 
at the extreme bad end of a scale were then assigne 
a value of one and those at the good end were assigne 
a value of 6. The mean value of the ratings given to 
the good co-worker was obtained, as was the mean 
value of ratings given to the bad co-worker. The differ- 
ence between these two means is the S’s Elevation 
Difference score. 


Two KINDS or ASSUMED SIMILARITY BETWEEN OPPOSITES 


Profile Dissimilarity Scores 


Each S's Profile Dissimilarity score was obtained by 
subtracting his squared Elevation Difference score 


. ASo 
from his score. The Profile Dissimilarity score is 


the square root of the difference obtained in this man- 
ner. 


Self-Ratings 

As noted above, each S rated himself three times: 
once attempting to be as accurate as possible (realistic 
self-ratings), once giving himself the benefit of any 
doubts (optimistic self-ratings), and once without giv- 
ing himself the benefit of any doubts (pessimistic self- 
ratings). The mean and standard deviation of these 
three sets of self-ratings were computed for each S. 

F Scale. A month after the previous data had been 
obtained each S responded to the F scale. Items were 
Scored in the standard manner.? 


RESULTS 


As should be expected, both Elevation Differ- 
ence scores and Profile Dissimilarity scores were 
found to be positively correlated with ASo scores. 
The product-moment correlations are .75 and .53 
respectively. With a sample of 42 persons, both 
correlations are significantly different from zero at 
the .01 level. The correlation between Elevation 
Difference scores and Profile Dissimilarity scores 
is —,03. The two components of ASo appear to be 
empirically independent of one another. 

Relationships between each of the three mea- 
sures and other variables are indicated by the 
product-moment correlations shown in Table 1. 
Three different patterns of relationships are dis- 
cernible. F scores correlate significantly with ASo 
scores, but correlations with the two components 
of ASo are low and do not reach statistical signifi- 
cance. Standard deviations of self-ratings do not 
ficantly with either ASo scores or 


correlate signi 
but do correlate sig- 


Elevation Difference scores, t 
nificantly with Profile Dissimilarity scores. F] inally, 
the mean of the individual’s optimistic self-ratings 
(i.e., the mean of the self-ratings made by S when 
he was instructed to give himself the benefit of any 

ignificantly with either 


doubts) fails to correlate signif 
ASo scores or Profile Dissimilarity scores, but does 


correlate significantly with Elevation Difference 
scores. (Neither the mean “realistic” self-rating 
scores nor the mean “pessimistic” self-rating scores 
correlated significantly with ASo or either of its 
components.) 

These findings thus indicate that the two com- 
ponents of ASo sometimes correlate in different 
degrees (and possibly in different directions) with 


2 Form 45 of the F scale (1, pp- 255-257) was em- 
ployed in this research, but item number 22 was deleted 
because it had become obsolete since the time the scale 


Was published. 
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TABLE 1 


CORRELATION OF ASO AND Its Two COMPONENTS TO 
OTHER VARIABLES 


F Mean of| Sigma of | Mean of 
E Scale Self- | Self- | Optimistic 
ratings | ratings |Self-ratings 
N 
38 42 42 42 
ASo scores —.43°*] .02 29 2 
Elevation Difference | —.21 +18 —.07 35° 
Scores 
Profile Dissimilarity | —.29 | —.09 62°" | —.10 
Scores 


* Reaches the .05 level. 
** Reaches the .01 level. 


other variables. The components may also combine 
to produce ASo scores which correlate more highly 
with other variables than do either of the com- 
ponents taken separately. 


Discussion 


The ASo score, according to these results, is a 
gross measure that combines two uncorrelated 
variables. One of these variables is the perceived 
discrepancy between the overall “goodness” of the 
traits possessed by two co-workers; the other vari- 
able is the perceived dissimilarity between the 
patterns of traits possessed by two co-workers. 
Each of these two variables contributes substan- 
tially to the ASo score, but two Ss may obtain the 
same ASo score even though their scores on the 
two components of ASo are very different. For 
example, two Ss in the present study obtained ASo 
scores of 166; for one S the (M, — M»)? component 
was responsible for seven per cent of the ASo 
score, whereas for the other S this component was 
responsible for 82 per cent. For most purposes it 
would probably be unrealistic to treat these two 
persons as though their ASo scores were equivalent. 

When ASo scores are related to an external 
variable, a significant correlation may be obtained 
because either or both of the components of ASo 
are correlated with the external variable. Or, an 
insignificant correlation may be obtained because 
neither component is correlated with the external 
variable, or because both are correlated but in 
opposite directions. Since it is quite possible for 
one or both components to correlate much more 
highly with an external variable than does ASo 
itself, it is recommended that studies of “assumed 
similarity between opposites” deal with the com- 
ponents as well as with ASo. 

In this research the direction of the relation- 
ships between F scale scores and the two compo- 
nents of ASo was not predicted in advance, but 
significant relationships were expected. According 
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to Adorno et al. (1), the high F scorer is reluctant 
to see differences between members of his in- 
groups; all members are good, whereas people in 
out-groups are bad. On the other hand, the low F 
scorer is described as willing to see various combi- 
nations of traits in members of his in-group, and to 
minimize differences between members of in-groups 
and out-groups. Consequently, if one assumes that 
both the good and the bad co-workers are regarded 
by the S as members of his in-group, both com- 
ponents of ASo should be small for high F scorers 
and large for low F scorers. But, if the poor co- 
worker is regarded as a member of an out-group, 
both components of ASo should be larger for the 
high F scorer than for the low F scorer. The findings 
reported in Table 1 suggest that when people are 
asked to think of someone with whom they have 
worked poorly, they tend to think of an in-group 
member. 

There are two possible interpretations of the .62 
correlation between Profile Dissimilarity scores 
and the standard deviations of Ss’ realistic self- 
ratings. Perhaps a response set leads some people 
to give ratings that are highly variable from item 
to item, and leads other 
that are rather constant 
these response sets operated both when S rated 
himself and when he tated his good and bad co- 
workers, a correlation between Profile Dissimilarity 
scores and the standard 


shows that when standard deviations of Ss? opti- 
mistic and pessim 
and added to th 


realistic self-ratings, the correlation between the 


mechanical” response set 
and suggests a second explanation, Possibly some 
Persons really do see themselves and other people 


as the Possessors of a mixture of good and bad 


do not provide a fi 
: i. rm 
basis for choice between these two explanations 


However, whatever the proper explanation may 
be, it is clear that Profile Dissimilarity scores 
represent a variable and potentially significant 
component of ASo, and that this component is 
empirically independent of the Elevation Differ- 
ence component. 


5. FIEDLER, F. E. 


Ivan D. STEINER AND CoLIN G. MCDIARMID 


The findings of this study suggest that Fiedler’s 
highly productive research might be made even 
more valuable if ASo scores were broken into 
their component parts. Such a step might also be 
helpful in clarifying the reasons why the ASo of a 
leader is related to his group’s efficiency. Fiedler’s 
explanation of his results rests upon assumptions 
concerning the personalities of high and low ASo 
leaders. For example, the leader with a large ASo 
score is said to be “critical, reserved, and rejecting, 
and especially so toward men who he considers to 
be poor co-workers.” He is Presumed to discrim- 
inate sharply between good and bad co-workers, 
to resist emotional involvement, and to be guided 
by task-oriented attitudes. These assumptions 
concerning the personal qualities of leaders with 
large ASo scores have not yet been tested, and 
Fiedler himself has indicated that he regards them 
as tentative. In light of the present study it is 
reasonable to guess that leaders with identical ASo 
Scores may have quite different Personal qualities 
depending upon the size of the components of their 
ASo scores. Somewhat more complex assumptions 
concerning the relationships between ASo and 
personality variables would seem to be needed. 


SuMMARY 


The conventional measure of assumed similarity 
between opposites (ASo) can be broken into two 
components which are logically and empirically 
independent of one another. Taken separately 
these two components sometimes correlate more 
highly with external variables than does ASo it- 


self. Implications of these qualities of ASo are 
discussed. 
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CASTRATION, CIRCUMCISION, AND ANTI-SEMITISM 
IRVING E. ALEXANDER anp SHELDON BLACKMAN! 


Princeton University 


T his writings, Freud gave recurrent expression 
to the view that fear of castration is a causal 
factor in anti-Semitism. In 1909 he wrote: 


eon es is the deepest unconscious 
boys hear that a Jew Ree ching = ff his p mt 

s at a ething cut off his penis— 

a piece of his penis, they think—and this gives them 
a right to despise Jews (4, p. 179). 
In later statements he pointed out that castration 
and circumcision are equivalent in the unconscious 
(5), and also that the strangeness of the custom 
for gentiles serves to activate unconscious castra- 
tion fear (6). That these views found some meas- 
ure of acceptance in psychoanalytic circles can be 
seen in the work of Fenichel (2). 

As a consequence of changing medical attitudes 
toward infant care, circumcision is no longer con- 
fined to one group in our culture. This turn of 
events provides an opportunity to gather some 
controlled information pertinent to Freud’s spec- 
ulation. The particular question we shall ask is 
whether circumcision is a relevant variable in 
anti-Semitism as measured in a contemporary 


male gentile population. 


METHOD 


The measure of anti-Semitism used in this study 
is the 52-item A-S scale described by Levinson and 
Sanford (8). This scale was chosen because it has been 
standardized on a college population and some evi- 
dence is available for its reliability and validity. A 
further point in its favor for the present purpose is 
that it includes a “threatening” scale consisting of ten 
items related to Jews as powerful and possible over- 
powering figures. In the case of all the test items, a 
score of seven indicates strong agreement with the 
anti-Semitic position, a score of one, strong disagree- 
ment. If, as happened rarely, an item was omitted or 
for other reasons could not be scored, a value of four 
was arbitrarily assigned. Appended to the test was a 
data sheet on which relevant background information 
was recorded. Included was a section on health which 
contained questions on surgery undergone, e.g. ap- 
pendectomy, tonsillectomy, circumcision, etc., and the 
time at which the operations had been performed. 

This scale was administered to 381 students drawn 
from various psychology courses at Princeton and 
Rutgers Universities.? The subjects (Ss) included in 
this study are 142 gentile male undergraduates selected 
from the larger population and divided into two 
matched groups of 71 each. The groups were matched 


1 Now at the Ohio State University. 
2 We are grateful to Mr. Edward Nolan of Rutgers 


University, Newark Branch, for assistance in gathering 
the sample. 


TABLE 1 
MEANS AND STANDARD DEVIATIONS OF SCORES ON THE 
A-S SCALE 
Group Mean SD 
Full Scale 
Circumcised 146.5 42.8 
Uncircumcised 151.8 43.4 
“Threatening” Scale 
Circumcised 26.4 7.5 
27.4 9.4 


Uncircumcised 


on the basis of background variables the relationship 
of which to anti-Semitism had been investigated in 
the literature (7). These variables include family in- 
come, geographical location of home town, religion, 
church attendance, and frequency and intimacy of 
contact with Jews. The sample is relatively homogene- 
ous with regard to other background variables such as 
age, rural-urban differences, and intelligence so that 
no effort other than inspection was made to control 
these factors. The one variable upon which the groups 
are known to differ systematically is whether or not 


the members are circumcised. 
RESULTS 

Means and standard deviations for each group 
on both the full scale and the “threatening” sub- 
scale are presented in Table 1. In both instances, 
on the full scale and on the threatening scale, the 
group mean for the uncircumcised Ss exceeded 
that of the circumcised. However, these differ- 
ences were small and statistically unreliable. An 
examination of the F ratios (full scale F = .43, 
threatening scale F = 1.01) gives no grounds for 
rejecting the view that they are attributable to 
chance.’ The results of this study do not support 
the hypothesis that circumcision is a relevant vari- 
able in determining attitudes towards Jews and 
in turn cast doubt upon Freud’s assumption as to 
the relation of castration anxiety to anti-Semitism. 


Discussion 


i Several considerations come to mind in assess- 
ing the bearing that this study might have on the 


3 The analysis of variance technique does not take 
into account the positive correlation introduced by 
the matching of the groups. No satisfactory technique 
exists for estimating the correlation when groups are 
matched on the basis of total composition. The magni- 
tude of F in each case is so small that the correlation 
would have to be above .9 in order to change the inter- 


pretation of our results. 
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castration—anti-Semitism hypothesis. One con- 
cerns the apparent evolution of attitudes toward 
this practice in the last twenty years. If few people 
in the culture at the present time, circumcised or 
not, have negative feelings about infantile circum- 
cision, one might argue that this change has ob- 
scured the validity of Freud’s original contention. 
The custom is no longer strange, and therefore 
ought not to arouse castration fear. This argument 
we would grant for children born in the present 
and future decades, but not for children born dur- 
ing the early stages of cultural change in attitude. 
During our Ss’ formative years, both positive and 
negative attitudes toward circumcision were wide- 
spread and accepted in the larger culture. 
Another consideration is whether we were deal- 
ing with a subculture so demanding of conformity 
from its members (the sample was predominantly 
composed of Princeton students) that it tended to 
obscure any real underlying attitudinal differ- 
ences. That is to say, perhaps we were getting a 
reflection of a Princeton stereotype towards Jews 
rather than a true indication of the S’s feelings. 
The fact that the scores of Jewish and gentile 
students in the entire Princeton sample differ sig- 
nificantly (p = .05) suggests that the demands 
of the subculture leave room for expression of at- 
titudinal differences. 
Recently, two other investigators have pre- 
sented discussions on the relationship of circum- 
cision to castration which bear indirectly on the 
question we are considering. Bettelheim (1) and 
Fraenkel (3) have both questioned the widespread 
psychoanalytic belief that circumcision as a ritual 
stems from or has the meaning of a castrating act. 
Bettelheim, after a study of some anthropological 
and clinical material, finds no general evidence to 
support the idea that the practice arose in order 
to maintain an incest taboo or even that it evolved 
from father dominated societies. The Jewish prac- 
tice, a relatively late one in history, he asserts is 
atypical in that it is carried out in infancy and is 
restricted entirely to males. The meaning of the 
circumcision rites, according to Bettelheim, is 
more frequently related to fertility and to the 
individual fulfillment of an unconscious bisexual 
wish. He offers no support for the assumption that 
circumcision and castration are necessarily sym- 
bolically equivalent or that circumcision would 
inevitably have negative valence in the uncon- 
scious. 
Fraenkel concluded from a study of ancient 
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Jewish religious treatises, particularly the Penta- 
teuch, that circumcision appeared to be more 
related to the benediction of fecundity than to 
an attenuated castration inflicted by the chief on 
his son. Only after circumcision was Abraham 
able to beget Isaac. Fraenkel characterizes the 
promise of the ritual by a quotation from the book 
of Genesis, “His seed will be as numerous as the 
sands of the earth.” If the circumcision-castration 
argument cannot therefore be invoked with con- 
fidence, then Freud’s speculation with regard to 
anti-Semitism loses its raison d’être. 


SUMMARY 


To obtain data relevant to Freud’s speculations 
on the relationship between circumcision and anti- 
Semitism, attitudes towards Jews were measured 
among matched groups of circumcised and uncir- 
cumcised male gentile college students. No reliable 
differences in attitude were found between cir- 
cumcised and uncircumcised subjects. These find- 
ings indicate the relative unimportance of circum- 
cision as an isolated factor in anti-Semitism. The 
implications for the relationships involving circum- 
cision, castration, and anti-Semitism are discussed. 
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CONSISTENCY IN RESPONSE TO GROUP PRESSURES! 


STANLEY ROSNER 
Society for Mental Hygiene, Bridgeport, Connecticut 


TZ studies to be described take as their 
point of departure the investigations of 
Asch (1, 2) on modification of judgments under 
group pressure in face-to-face situations. Asch has 
reported that naive subjects (Ss) yielded approxi- 
mately one-third of the time to the erroneous 
judgments of a tutored majority in matching the 
length of comparison lines with a standard line. 
The Ss who yielded early in the line series (con- 
sisting of 12 critical trials) tended to continue to 
yield throughout, while Ss who were independent 
of the announced judgments of the tutored ma- 
jority, tended to continue independent throughout. 

The purpose of the present studies is to answer 
the question: Are the consistencies in yielding or 
not yielding specific to the situation studied, i.e., 
judgment of lines, or do they extend beyond it? 
Is such consistency episodic or situationally deter- 
mined, or is it a consequence of relatively settled 


modes of orientation? 


METHOD 


To answer the above question, 88 student nurses? 
were observed in each of several Asch-type situations, 
in each of which they had to make a choice between 
being independent or conforming to the judgments of 
three accomplices? who were tutored by the experi- 
menter to make erroneous judgments. 


Tasks! 

Three memory tasks (nonsense words, nonsense 
figures, and a simple paragraph), a questionnaire, and 
the line series used by Asch (2) were employed in the 


experiment. Necessary controls for order of presenta- 
tion and equating of control and experimental series 


were observed. 


of a dissertation submitted in 

e requirements for the degree of 
Doctor of Philosophy to the Graduate Faculty of 
Political and Social Science, The New School for Social 
Research, New York City. The writer owes a special 
debt of gratitude to Dr. Mary Henle for her encour- 
agement of the research and for her critical reading of 
the manuscript. 

2 Thanks are owed to Miss Evelyn Lehrer, Assist- 
irector (formerly) of the Westchester School of 
Nursing and to Miss Barbara Tate, Associate Director 
of St. Lukes Hospital School of Nursing for helping 
to make their respective student bodies available. 

3 Asch (2) has shown that the effect of a majority 
of three on the judgments of naive Ss is as great as 
that of larger groups. 

4 For details, the rı 
tion (3). 


1 This study is part 
partial fulfillment of th 


ant D 


eader is referred to the disserta- 


RESULTS 


1. Where comparisons are possible, present 
findings confirm those of Asch (1, 2). 

2. A high degree of consistency in response to 
group pressure was found within a given task at 
a single sitting; Asch’s finding that Ss who were 
independent early in the experimental session con- 
tinued independent and Ss who yielded early in 
the experimental session continued to yield is 
substantiated with the present population. 

3. Response to group pressure was significantly 
consistent between tasks at a single sitting. The 
Ss who were low yielders on one task tended to 
be low yielders on other tasks; high yielders on 
one task tended to be high yielders on other tasks. 
Phi coefficients ranging from .33 to .54 were ob- 
tained between pairs of tasks with respect to high 
and low yielders. 

4. Response to group pressure was significantly 
consistent over a period of time. The Ss who 
yielded or did not yield during one sitting tended 
to yield or not to yield, respectively, during later 
sittings as shown in Table 1. 

5. The embedded figures test (4, 5) was ad- 
ministered to 20 high and 20 low yielders to ex- 
plore the relationship between this test and the 
group pressure situation. No significant difference 
was found between the two groups in the amount 
of time they required to locate the simple in the 


TABLE 1 


CONSISTENCY OF YIELDING BEHAVIOR OVER A TIME 
LAPSE 


No. of Judgments Showing Yielding 


, Lines Lines 
Subjects Session I* Session II 

Yielders 

1 12 {1 

2 8 5 

3 8 12 

4 11 1 

5 9 12 
Total 48 mi 
Nonyielders 

1 0 1 

2 0 0 

3 2 1 

4 2 0 

5 0 1 
Total 4 3 


* Procedure identical to that described by Asch (2). 
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complex figures, though the existing differences 
were in the expected direction. The yielders failed 
to locate the simple in the complex figures sig- 
nificantly more often than the nonyielders. 
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ON THE RELATIONS BETWEEN DISCRIMINATION REACTION 
TIME, ANXIETY, AND INTELLIGENCE 


G. ROBERT GRICE 
University of Illinois 


N A test of a theory concerning the effects of 
I drive on performance involving conflicting 
habits, Grice (4) found the predicted difference 
on a complex discrimination reaction time task 
between two groups differing in “manifest anx- 
iety.” However, a covariance analysis indicated 
that this difference could be attributed to differ- 
ences between the groups in intellectual ability. 
Freeman and Maher (3) have criticized Grice’s 
conclusion on the basis of speculations that the 
Discrimination Reaction Time Test (DRT) is 
“largely perceptual in character” and that per- 
ceptual factors in the Clerical Aptitude Index 
(employed as an intellectual measure) may have 
produced the obtained relation rather than gen- 
eral intellectual ability. Citing the excellent 
study by Fleishman and Hempel (2) as evidence, 
they argue that intellectual components in DRT 
tend to disappear with practice and that percep- 
tual speed and simple reaction time assume greater 
importance. 

In the first place, it should be pointed out that 
the task analyzed and described by Grice was 
quite different from that employed by Fleishman 
and Hempel. Instead of a long period of practice 
with a single set of instructions, the task (Grice’s 
Problem 5) involved a signal, one second in ad- 
vance of each reaction, which informed S which of 
four rather complicated sets of instructions were 
to be employed in determining the correct re- 
sponse. It would appear likely that this involved 
more intellectual ability than even the early trials 
of the conventional DRT task. In the second 

place, it should be pointed out that “Perceptual 

Speed” was the only purely perceptual factor 
. d in the Fleishman and Hempel 

which emerge A 

analysis and that it accounted for only five per 

cent of the DRT variance even after long practice. 


This cannot be regarded as strong evidence that 
the test was “largely perceptual in character.” 
The other point raised by Freeman and Maher 
concerned the suitability of the Clerical Aptitude 
Index as a general intellectual measure and the 
question of the presence of perceptual factors in 
it. They ask, rhetorically, “Were not Air Force 
data on tests more universally recognized as gen- 
eral intellectual measures available?” The answer 
is simple. No. The Clerical Aptitude Index is 
actually a rather good measure of intellectual 
ability as Grice originally stated. It is based ‘on 
four tests: Word Knowledge, a test of academic 
vocabulary; Background for Current Affairs, 4 
test of general information; Numerical Opera- 
tions, a test involving simple computations; ri 
Dial and Table Reading, a test requiring the 
verification of readings on groups of dials and the 
determination of certain information by reading 
mathematical tables. It was this last test which 
lead to the statement by Grice that the index con- 
tained perceptual factors. However, perceptua 
ability is by no means its major source of variance. 
In a factor analysis by Zachert (5), the test hae 
a loading of .33 on a factor named Perceptual 
Speed. It also had loadings of 50 on Numerica 
Facility, .39 on General Reasoning, and .29 0M 
Verbal Comprehension. In a more recent analysis 
by Bechtoldt (1), it had a loading of .17 on z 
Space factor and loadings of .31 and .26 on Acà 
demic Background and Induction respectively. 
The other tests show their major loadings 0? 
factors named Verbal Comprehension, Number, 
and Academic Background (1). All four tests have 
their major projections on second-order factors 
labeled by Bechtoldt as “G” (General Intellec- 
tive) and “VM” (Facility with Verbal Materials). 
With the nature of the tests in mind, it can now 
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TABLE 1 
PERFORMANCE OF HIGH AND Low Anxious GROUPS 
on Eacu TEST 


n High A| LowA 
Test t $ 
Mean | Mean 
Word Knowledge 4.40 | 6.03 |3.47|<.01 
Numerical Operations 4.40 | 5.37 |2.05|<.05 
Dial & Table Reading 3.37 | 4.70 |2.51|< .02 
Bkgd. Cur. Aff. 4.13 | 5.40 |2.61/<.01 


be pointed out that in Grice’s study, each of the 
four significantly differentiated between the high 
and low anxious groups and that each was also 
positively correlated with response speed in the 
reaction time task. The means of the high and low 
anxious groups and the values of ¢ are presented 
in Table 1. The product-moment correlations of 
Tesponse speed in Problem 5 in the DRT task 
with the individual tests were as follows: with 
Word Knowledge, .50; with Numerical Opera- 
tions, .42; with Dial and Table Reading, .63; and 
with Background for Current Affairs, .33. 

These considerations strongly suggest that the 
relationships reported by Grice were due to intel- 
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lectual variance in common with “manifest anx- 
iety,” “clerical aptitude,” and the reaction time 
task, and that the speculations of Freeman and 
Maher are contrary to fact. 
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WARMTH IN OTHERS 


JOSEPH BOSSOM AND A. H. MASLOW 
Brandeis University 


HE effect of variation among the people 
being judged has been considered in pre- 


f first impressions (2). The present 
d with the effect of variation 
among judges upon their first impressions of 
others. Differences in the first impressions of se- 


cure and insecure judges are investigated. 


vious studies 0! 
study is concerne 


METHOD 


Subjects 


Judges were 105 undergraduate students enrolled 


in ‘an elementary course in psychology. From this 
total group were selected 22 judges who made scores 
of 9 points or Jess on the Maslow S-1 test for security 
(1), the secure group. Their scores ranged from 0.89 
to 1.62 SD below the mean of the total group. Twenty- 
two judges, who scored 30 points or more comprised 
the insecure group. Their scores ranged from 0.71 to 
2.65 SD above the total group mean. 


Stimulus 
The “objects” to be judged consisted of the first 


200 photographs obtained from yearbooks of former 
graduating classes. Except for difference in age (about 
two years), the “objects” and the judges come from 
about the same socioeconomic, racial, and religious 


background. 


Exposure to Stimulus 


Each judge was handed an answer sheet with line 
space for 200 responses. The categories of response 
were headed: Very Warm; Warm; Cold; Very Cold. 
All judges were given the following verbal instructions: 

You will be shown 200 photographs. They will 
appear on the screen in front of the room, These 
photographs will be changed rather rapidly. After 
looking at a photograph you will have sufficient 
time to indicate your impression of the person in 
the photograph. This is done by placing a mark in 
the appropriate box on your answer sheet. 

Subsequently, the 200 photographs were projected 
on a screen. Each photograph was exposed for about 
5 seconds. Periodically, the number of the photograph 
was called out so that judges could determine whether 
they had missed an item. 


RESULTS AND DISCUSSION 


Judges in all groups were more likely to ascribe 
warmth than coldness to the persons represented 
in the photographs. The secure group, the insecure 
group, and the remaining judges (others) were 
compared with respect to the proportion in each 
group who ascribed warmth more frequently than 
coldness. As can be seen in Table 1, the secure 
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TABLE 1 


PERCENTAGE OF JUDGES IN EACH GROUP ASCRIBING 
WARMTH More OFTEN THAN 
COLDNESS TO PHOTOGRAPHS 


Group 
ate- k a Probability of 
gori Compared | Se; | Inse- | Oth Diference 
W =| W =| WN = 
22) | 22) | 61) 

95 68% | 78% | Sec. os. Others 10 
Fais Gold % Insec. vs. Others .50 
Sec. vs. Insec. 05 
V. Warm & Warm | 90% | 54% | 78% | Sec. vs. Others 80 
vs. V. Cold & Insec. vs. Others 05 
Cold Sec. vs. Insec. 02 
V. Cold vs. V. | 77% | 68% | 42% | Sec. vs. Others 01 
Warm Insec. vs. Others 05 
Sec. os. Insec. +70 


and insecure groups differed significantly (p = 
.05) in the number of judges scoring Warm as 
compared with Cold. The difference between these 
groups was also significant when the Very Warm 
and Warm categories taken together were com- 
pared with Cold and Very Cold (6 = .02). In 
these comparisons a greater percentage of judges 
gave Warm responses in the secure group than 
among the “others.” The “others,” in turn, had a 
greater percentage of judges scoring warmth than 
the insecure group. When only the Very Warm 
and Very Cold categories were considered, sig- 
nificant differences were not obtained between 
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the secure and insecure groups. In addition, a 
higher percentage of judges in both extreme 
groups employ the Very Warm category than 
among the intermediate group of “others.” 

Since our data do not show whether the faces 
in the photographs are in “reality” warm or cold, 
we cannot attribute the obtained group differe- 
ences to differential efficiency of perception of 
reality, projections based upon habitual expecta- 
tions in interpersonal relationships, or to Freudian 
projection. Decision among these alternatives 
must await further research. 


SUMMARY 


From a larger group of judges, a secure and an 
insecure group were selected. All judges were ex- 
posed to a series of 200 photographs. Judges were 
required to indicate whether the person in the 
photograph was Very Warm, Warm, Cold, or 
Very Cold. The percentage of judges reporting 
Warm impressions more often than Cold impres- 
sions tended to be greater in the secure group than 
in the insecure group. 
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PREDICTING THE DIRECTION OF SELECTIVE RECALL: 
ITS RELATION TO EGO STRENGTH AND N ACHIEVEMENT! 


THELMA G. ALPER 
Wellesley College 


HEN the research findings in a given 
area continue over the course of 
many years to be as diverse as those 


on the recall of interrupted vs. completed tasks, 
the challenge to find out why can be strong. 
This has been the motivation behind the series 
of four papers by the writer (1, 2, 3, 4), the 
first of which appeared in 1946. At the time the 
research for the first two papers was underta- 
ken, several studies had appeared. All of them 
had been designed to test further the conditions 
under which the Zeigarnik (29) finding, that in- 
terrupted tasks are more likely to be recalled 
than completed ones, would hold. Many of 
these studies, as previous reviews of the 
literature have shown (e.g, 1, 3, 13, 26), 
seemed not to support Zeigarnik since they 
did not yield statistically significant recall 
differences in the expected direction under 
conditions where completion and incompletion 
of tasks was experimentally controlled. One 
experiment, in fact, (24) indicated a reversal 
in the recall direction, i.e., a greater recall of 
completed than of interrupted tasks. The 
subjects (Ss) who served in that experiment, 
a group of crippled children, had been told 
“that they were to be given a test to de- 
termine how well they could do puzzles and 
that a prize would be given the one who did 
best” (24, p. 249). Not all of the Ss behaved 
in the same way. Some showed the reversal, 
some did not, others recalled an equal number 
of completed and interrupted tasks. The 
reversal pattern, however, was characteristic 
mainly of Ss rated by their teachers to be 
high on “pride.” Though the ratings were 
admittedly not highly reliable, this was one of 
the first studies to support Zeigarnik’s sugges- 
tion that personality factors may play a part 
in selective recall (29, p. 303). Zeigarnik’s 
finding here with adults was that “ambitious” 
subjects, identified by remarks such as “What 
is the matter with me—!” or “I wouldn’t 
have believed I could be so stupid,” recalled 


1The writer acknowledges with thanks the invalu- 
able help of Henry A. Murray, Robert W. White, and 
and Eugenia Hanfmann. 


even more interrupted than completed tasks 
(29, p. 307). 

The importance of another variable is also 
suggested by the Rosenzweig and Mason 
study; namely, the orientation of the experi- 
mental instructions. Working for a prize 
which you may get by doing well on a test is 
probably dynamically not the same as working 
on some tasks because someone asks you to 
do so.? Zeigarnik’s instructions had been “I 
shall give you a series of tasks which you are 
to complete as rapidly and correctly as pos- 
sible” (29, p. 300). In a later experiment, 
Rosenzweig (23) tested the orientation vari- 
able. He asked one group of college males, 
the “informal” group, to help the experimenter 
(E) classify some jigsaw puzzles in preparation 
for their use in a later experiment. Other Ss, 
the “formal” group, were told that the puzzles 
were part of an intelligence test. The selective 
recall results of the two groups were found to 
differ: the informal group recalled significantly 
more interrupted tasks, the formal group 
significantly more completed tasks. In Zeigar- 
nik’s study, no attempt had been made to 
maintain a uniform experimental atmosphere. 
On the contrary, Zeigarnik tells us that in 
keeping with differences in the Ss’ reactions 
to the situation, wanting “to please the £,” 
or “to excel as if in competition with others,” 
or “interested in the task for its own sake” 
(29, p. 303), the Æ “did not preserve a fixed 
mien and method with all subjects. Those of 
the first type were allowed to see the experi- 
menter’s pleasure when the task was well 
done. Work by the second group was inspected 
with the air of an examiner, while the third 
group was allowed to work unmolested, the 
experimenter in this case remaining passive” 
(29, p. 303). 

In the meantime, Alper (1), working also 
within Zeigarnik’s tension theory of selective 
recall, and taking up Zeigarnik’s (29, footnote 
1, p. 312) suggestion that the recall of a given 

2 See Alper (3), for a further discussion of this point 


with special reference to the sample used in the 
Rosenzweig and Mason (24) study. 
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S might differ in different experimental set- 
tings, proposed that a more sensitive way to 
test for the conditions that influence the 
direction of recall might be to measure the 
recall of the same Ss under two different ex- 
perimental settings: task orientation in one 
session, ego orientation in the other. To create 
a task-oriented atmosphere, the instructions 
emphasized, as Rosenzweig (23) had done, that 
E was testing the materials, not the S. For 
the ego-oriented atmosphere, where the in- 
tention was to threaten self-esteem, Ss were 
told that this was a standardized set of tasks 
and that they constituted a brief intelligence 
test devised for the army to use in selecting 
candidates for officers’ training school. The Ss 
were all draft age college males. As predicted, 
Ss differed in their reactions to both instruc- 
tions, and there were no significant recall 
differences under either orientation. When the 
recall results were correlated with personality 
ratings derived from an intensive clinical 
study of these Ss within the framework of 
Murray’s (21) need theory of personality, 
however, two major patterns of recall emerged: 
(a) the Strong Ego pattern, consisting of the 
greater recall of interrupted tasks under task- 
oriented conditions, and of completed tasks 
under ego orientation; and (b) the Weak Ego 
pattern, consisting of a greater recall of com- 
pleted tasks under task-oriented conditions, 
and of interrupted tasks under ego orientation. 
One other variable had been controlled in 
the above study—the nature of the tasks 
used. Alper and Black (4) have shown that 
most of the tasks used by Zeigarnik (29), 
Lewis (15), and others, would be unacceptable 
to college students as IQ tests. Or, to put it 
concretely, failure to complete within a given 
time limit tasks such as bead-stringing, wind- 
ing thread on a spool, circling vowels, etc., 
would not be regarded by an S as a reflection 
on his intelligence, though failure to complete 
an anagram or a scrambled phrases task 
would. The use of tasks not commensurate 
with the atmosphere E intends by his in- 
structions may account for some of the pre- 
viously equivocal findings in this field, as the 
writer has pointed out elsewhere (1, 4). In 
the experiments referred to above, the writer 
had used a scrambled phrases task. 
To test the validity of the Strong and Weak 
Ego recall patterns, the writer designed another 
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experiment. Now two groups of Ss, a Strong 
Ego and a Weak Ego group, respectively, were 
selected in advance by means of specially 
constructed instruments. From the direction 
of their total recall patterns over two sessions, 
task-oriented first, and later an ego-oriented 
one, it was possible to predict correctly the 
group membership of these Ss as Strong or as 
Weak Egos significantly beyond chance. This 
experiment has been cited briefly elsewhere 
(2), but has not been reported in full. 

In 1952, reacting to Sears’ (26) recommenda- 
tion that the interruption technique be 
abandoned as unsatisfactory since so little 
agreement had been reached by the various 
Es, the writer (3) tried to show: (a) that both 
orientation and personality were important in 
selective recall; and (b) that when both were 
considered, the seemingly diverse results of 
Zeigarnik (29), Rosenzweig and Mason (24), 
Rosenzweig (23), Glixman (13), Sanford (25), 
and others could be fitted into a common 
theoretical framework. The writer had in- 
tended to let the matter rest here. Recently, 
however, studies by Atkinson (5), and Atkin- 
son and Raphelson (6) have renewed her 
interest since the data reported by these 
authors put a different interpretation on the 
results of earlier studies. They report that 
high Achievers recall more interrupted than 
completed tasks under Achievement Orienta- 
tion while low 2 Achievers show a reversed 
recall pattern. While these are the same 
recall patterns the writer has discussed, the 
personality relationships involved are not the 
same and the interpretations given them are 
also not the same, In fact, on the surface, at 
least, the Achievement and Ego Strength 
frameworks would seem to have yielded dy- 
namically opposite results. 

Since 2 Achievement had not been measured 
in the writer’s earlier studies, to test the 
relationship between m Achievement and Ego 
Strength requires a new experiment. This 
experiment has now been done and is reporte 
in Part II of the present paper. In Part I, 
the writer has gone back to the prediction 
study mentioned above and is reporting it 
now in full since it confirms the validity of 
the Strong and Weak Ego recall patterns 
statistically revealed by the syndrome analysis 
used in the earlier study. It provides us; 
therefore, with a firmer base for testing the 
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relationship between 7# Achievement and Ego 
Strength and for applying the results to 
other selective recall studies. 


Part I 


PREDICTING THE DIRECTION OF RECALL 
OF STRONG AND WEAK Eco SUBJECTS 


In the writer’s earlier papers on the recall 
of interrupted vs. completed tasks (1, 2), the 
major premise had been that the direction of 
the recall ratio is a function of the personality 
structure of the individual recaller. It had 
been hypothesized “that the direction of 
selective recall of a given subject will differ in 
a non-self-esteem involving laboratory situa- 
tion and in a laboratory situation where self- 
esteem is objectively threatened in a manner 
which is consistent with the self-esteem needs 
of that subject” (2, pp. 104-105). In designing 
the experiment to test this hypothesis, there- 
fore, both instructional set and personality 
structure were taken into account. 

Ten male undergraduates had served as Ss. 
Each was tested first in a task-oriented and 
later in an ego-oriented setting. The intention 
had been to threaten self-esteem objectively 
only in the latter setting. Since personality 
ratings, based on intensive clinical studies of 
these Ss within the framework of Murray’s 
(21) need theory of personality were available, 
selective recall could be correlated with per- 
sonality structure. Horn’s (14) _ syndrome 
analysis technique was used for this purpose. 

The analysis yielded two major selective 
recall patterns. One pattern, made up of the 
recall of more interrupted than completed 
tasks under Task Orientation and of more 
than interrupted tasks under Ego 
Orientation, was labeled the “Strong Ego” 

he syndrome analysis showed it 


pattern since tl 
to be characteristic only of Ss who had been 
rated high on ” Recognition,’ 7 Dominance,‘ 


Ego Strength,® and Conative Conjunctivity,® 


completed 


achieve something in 
fame; to seek prestige 
ard; to boast of one’s 


34 Recognition: to seek to 
order to win acclaim, renown, 
and glory; to put oneself forw: 
accomplishments. 


4% Dominance: to seek to control the sentiments 


and behavior of others by suggestion, seduction, per- 
suasion or command; to lead others; to organize and 
direct groups of people; to get others to cooperate. 

5 Ego Strength: to know what one wants to do and 
has the capacity to do, and to do it. It is a matter of 
willing one’s desires within the limits of one’s abilities, 
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and low on Dejection Pessimism’ and Ego- 
Ideal Intragression.’ The opposite recall pat- 
tern, consisting of the recall of more completed 
than interrupted tasks under Ego Orientation, 
was labeled “Weak Ego” since it was charac- 
teristic only of Ss who had been rated high on 
Dejection-Pessimism and Ego-Ideal Intra- 
gression, and low on 7 Recognition,  Defend- 
ance,’ 2 Counteractive Achievement,” and 


Narcism.” 
In order to test whether these statistically 


derived patterns could be experimentally re- 
produced, the prediction study described here 
was designed. 


Method 

Two groups of nine Ss each, a Strong Ego and a 
Weak Ego group respectively, were selected in ad- 
vance of the experiment by means of specially con- 
structed clinical tools. All Ss were tested under Task 
Orientation first and later under Ego Orientation. 

It was predicted that the recall patterns of these 
Ss would be consistent with the findings of the earlier 
experiment since, apart from the fact that personality 
structure now served as an independent variable, the 
design of the prediction study followed closely the plan 
of the previous experiment. 

Selection of subjects. Two procedures were used to 
select Ss. First, a self-rating scale, The Psychological 
Insight Test, was constructed to measure the personality 
variables in the Strong and Weak Ego syndromes. 
Second, Ss whose self-ratings identified them as Strong 
or as Weak Egos were interviewed. Only Ss whose 


existing or acquirable. This manifests itself chiefly as a 
successful » Achievement, giving proof of the power to 
persist; an unconquerable will. 

6 Conative Conjunctivity: to organize one’s efforts; 
to make plans and to follow them; to force drives into 
an efficient pattern; to live an ordered life. This vari- 
able gives evidence of the ‘coordinating powers of the 
ego.” 
z 7 Dejection Pessimism: to worry about one’s short- 
comings; to be pessimistic about life; to recover slowly 
from disappointment. 

8 Ego-Ideal Intragression: to feel humiliated and 
ashamed after failure; to be burdened by feelings of 
inferiority. The ego is belittled and criticized for 
stupidity after failure. 

®n Defendance: to defend oneself against blame, 
belittlement, assault; to justify one’s actions; to offer 
extenuations, explanations and excuses for oneself. 

10% Counteractive Achievement: to set difficult 
goals for oneself and attempt to reach them. To stick 
at a task until satisfied with the results; to overcome 
obstacles; to restrive after failure; to redouble one’s 
efforts in the face of frustration. 

1 Narcism: to perceive the world from a 
subjective view-point; to be disdainful of o 
dominated by ruthless self-seeking; to pw 
indifferent to one’s failures and to one’s critics; 
self-satisfied; to be eccentric. 


personal or 
thers; to be 
rport to be 
to be 
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TABLE 1 
‘THE PsyCHOLOGICAL INSIGHT TEST 


A Sample of the First Twelve Items 


1.1 enjoy acting before a large audience. (n Recog- 
on 
9). J preier to rely on my own, judgments rather than 
depend on other men’s opinions. (Narcism) 
3. I look upon failure as a challenge to be met by 
redoubled efforts to succeed. (n Counteractive 
ievement, N A 
4. A at conscious of my main goals in life. 
(Ego Strength) | g g 
5. I usually can think of something to say in my own 
defense. (n Defendance) i 
6. I love good music and good wine and have become 
quite a connoisseur in such matters. 
7. I usually decide for myself what I want to do and 
do it. (Conative Conjunctivity) 
8. I am more apt to give in to an, opponent than tọ 
keep on arguing with him. (Dejection Pessimism) 
9, I spend a lot of time thinking about my short- 
comings. (Ego-Ideal Intragression) wae 
10. I enjoy organizing or directing the activities of a 
group—team, club, or committee. (n Dominance) 
11. I like to amuse people by wisecracking or playing 
the fool. (2 Recognition) 
12. I feel that I am temperamentally different from 
most people. (Narcism) 


self-ratings were confirmed by the clinical interviewer 
were asked to serve in the experiment. 

The Psychological Insight Test. This test, consisting 
of two hundred statements, was constructed by the 
writer according to principles outlined by Murray and 
his co-workers (21). It provides a measure of the six 
personality variables within the Strong Ego syndrome 
and the six within the Weak Ego. Since three of the 
variables, » Recognition, Dejection-Pessimism, and 
Ego-Ideal Intragression, appear in both syndromes, 
only nine variables are included in the test. Each is 
measured by a set of 20 statements. Twenty additional 
statements which tap a variety of disparate needs were 
included but not scored. (See Table 1%.) 

The test, self-administered, requires S to rate him- 
self on each statement on a +3 to —3 scale, the scale’s 
zero point being the behavioral and emotional reactions 
of most men of S’s age, i.e., “the hypothetical average 
among college men.” Thus, +3 represents very much 
more often than the average college man, —3 very much 

less often, +1 average but on the high side, — 1 average 
but on the low side, etc. 


12 These statements were added both to disguise the 
nature of the test and to facilitate scoring. Nine vari- 
ables and a “joker” variable repeated 20 times each 
allowed for the following scoring arrangement: state- 
ments measuring a given variable, for example z Recog- 
nition, were placed in the first, eleventh, twenty-first 
etc., positions; a second variable, Narcism, in the 
second, twelfth, twenty-second, etc., positions Since 
the statements for a given variable are sufficient! 
diversified in the test the scheme is not readily ae: 
tected by S. This plan is demonstrated by the samy le 
set of statements in Table 1. The need which ak 


measured has been noted in parenth i 
eses foll 
statement. PEE DA 
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When scoring the test, Strong Ego and Weak Ego 
scores are computed separately for each S. Each score 
consists of two parts: the algebraic sum of those vari- 
ables which should be high for the given pattern, and 
the algebraic sum of those variables which should be 
low. A Strong Ego’s score on Strong Ego variables, 
thus, would yield a +, — pattern, a Weak Ego’s score 
on Weak Ego variables, a —, + pattern. Arithmetically, 
the scores on the Strong Ego variables can range from 
+240, —120 to —240, +120. The same is true for the 
Weak Ego variables.” 

The Psychological Insight Test was administered to 
140 male undergraduates enrolled in an introductory 
course in psychology at Harvard University under 
“take home” conditions. Eighty-six tests were Te- 
turned. A Strong and a Weak Ego score was computed 
for each S, following the procedure described above. 

In order to determine the range of Strong and of 
Weak Ego scores obtained by this sample, the bipart 
scores were combined into a single score by changing 
the sign of the second half of the score and adding the 
resulting sum to the first half of the score. For example, 
a score of +50, —25 would become +75, a score of 
—50, +25 would become —75. The scores of the 
eighty-six Ss on the Strong Ego variables ranged from 
+174 to —174; on the Weak Ego variables from +152 
to —112. The Ss whose scores fell in the upper quartile 
of the Strong Ego distribution were selected as poten- 
tial Strong Ego Ss. The Ss whose scores fell in the 
lower quartile of the Weak Ego distribution were selec- 
ted as potential Weak Ego Ss. Of these, twelve Strong 
Ego scoring Ss and eleven Weak Ego scoring Ss agreed 
to come for an interview to the Harvard Psychological 
Clinic for the purpose of “possible participation in some 
experiments at the Clinic.” 

The interview. The interview, conducted cither by 
Dr. Henry A. Murray or by Dr. Robert w. White, 
was structured around the Strong and Weak Ego syn” 
drome variables, definitions and criteria for which were 
provided to the interviewer. Using a 0-5 point scale, 
the interviewer rated S on eight of the nine variables 
in these syndromes! and on the basis of these ratings 
labeled Sasa “Strong Ego”, a “Weak Ego” or “neither 
consistently Strong nor consistently Weak.” 

__ If the self-ratings and the interviewer's ratings Wer’ 
in agreement, either consistently “Strong” OY con- 
sistently “Weak,” S was asked by the interviewer to 
serve in the experiment. In this way, the two groups 


*%To obtain the maximum Strong Ego score, S 
would have to rate himself +3 on all statements (80) 
pertaining to » Recognition, » Dominance, Ego 
Strength, and Conative Conjunctivity, and —3 on all 
statements (40) pertaining to Dejection-Pessimism an 
Ego-Ideal Intragression. It is to be noted that unlike 
the McClelland ef al. (16) n Achievement variable, the 
present syndromes do not represent opposite ends 0 
the same personality continuum. Only some of the 
variables appear in both syndromes. 

“Through an oversight on the part of the writer, 
one variable, Narcism, had been omitted from the 
interview schedule. 

15 A partial measure of the sensitivity of The Psy- 
chological Insight Test is reflected by the fact that only 
five of the 23 Ss interviewed had to be rejected for the 
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of nine Ss each were selected. Their group membership 
“Strong” or “Weak” was not known to the writer 
until after the prediction scores had been computed 
at the end of the experiment. g 

Materials. The experimental materials were the same 
as those used in the earlier experiment (1). They con- 
sisted of two comparable sets of scrambled phrases to 
be arranged into meaningful sentences. Of the 12 
sentences in each set, six were readily solvable within 
the two-minute time limit, three too difficult for con- 
trol Ss to solve within the time limit, and three were 
unsolvable. Each solvable sentence permitted four 
alternative solutions. 

Procedure. The procedure of the earlier experiment 
(1) was followed exactly. The atmosphere and instruc- 
tions in Session I, Task Orientation, were informal and 
objectively non-self-esteem threatening since ostensibly 
S was merely helping Z find out whether the materials 
would be suitable for a later experiment. The atmos- 
phere and instructions in Session I, Ego Orientation, 
a week later, were formal and objectively threatening 
to self-esteem, sentence solution now being presented 
as “a brief intelligence test which the Clinic had devised 
for use by the Army in selecting officer training school 
candidates.” To heighten the threat, moreover, & male 
accomplice, introduced as another S, worked in the 
same room on the same sentences and “successfully 
completed” all tasks within the time limits allowed. 

At the end of the session, following a ten-minute 
interpolated projective test task,!° incidental recall of 
the sentences was measured. 

Treatment of recall data. In order to adjust for indi- 
vidual differences in number of sentences actually com- 
pleted, two percentage recall scores were computed for 
each S: percentage of completed tasks recalled, and 
percentage of interrupted tasks recalled. This was done 
separately for each session. Since percentage scores are 
nonlinear, each was converted into angles by the arc 
sin/ percentage transformation (27, Table 16.8, p- 
449) before applying tests of significance. 

Prediction of Whether a given S fitted the Strong 
or the Weak category was based on his total recall 
score. The formula used for obtaining S’s total recall 


was as follows: 
Sota Recall Score = (Int — Cy) — (inn — Cu) 
where Inr refers to S’s transformed score for the recall 
of interrupted tasks in Session I, Cr to his transformed 
score for the recall of completed tasks in. Session I, 
Inz to his transformed score for the recall of interrupted 
tasks in Session II, and Cir to his transformed score 
for the recall of completed tasks in Session II. 

To support the earlier results (2), the total recall 
score of a Strong Ego should be positive, & Weak Ego 
negative. ‘The correctness of the prediction was tested 
by means of Fisher’s (11) exact test. 


To test whether the experimental groups differed 
in the direction of their recall within each orientation, 


ored “Strong” on the self-rating 
“Weak” by the interviewer and 
stently fit the interview criteria 


experiment. One S sc 
scale and was rated 
four Ss did not consi: 
for either “Strong” or “Weak.” 

16 The results of this test, The Mind Reading Test, 


are not included in this report. 


TABLE 2 


PREDICTION OF STRONG AND WEAK Eco, 
SELECTIVE RECALL PATTERNS FOR Two 
Groups OF MALE UNDERGRADUATES, 
A StronG Eco GROUP AND A 
Weak Eco Group 


Strong Ego Weak Ego 
Category Recall Recall ip* 
Pattern® Pattern? 
Expected 9 9 
Obtained 9 8 
Correctly 7 6 045 
Predicted 


a Based on total recall pattern over two sessions, (Int — Cr) — 
(Inir — C11); to be counted as a Strong Ego pattern the score 


must be positive. 
b Based on total recall pattern over two sessions, (Int — C1) — 


(Inr — Cır); to be counted as a Weak Ego pattern the score 


must be negative. 
* Computed by means of Fisher’s (11) exact test. 


the Mann-Whitney U Test (16) was applied to the 
transformed percentage recall scores. 

Treatment of performance data. Differences between 
the two groups of Ss in number of sentences solved, in 
number of alternative solutions per sentence achieved, 
in time to achieve a first solution, and in total number 
of sentences recalled were analyzed by means of 
analysis of variance. 


Results 

Prediction of group membership. Nine of the 
18 Ss had positive total recall scores, eight 
negative total recall scores, one a zero score. 
Since a zero recall score does not permit pre- 
diction of group membership, this S was 
omitted from the prediction analysis.” As 
shown in Table 2, the prediction of Strong 
Ego was found to be correct for seven of the 
nine positive total recall scores, the prediction 
of Weak Ego for six of the eight negative ones. 
Fisher’s exact test yields a P value of .045 for 
these data. The null hypothesis that the 
direction of selective recall is unrelated to 
personality structure, as measured in this 
experiment, can therefore be rejected. 

Selective recall of Strong and Weak Ego 
Groups within each orientation. The analyses 
from this point on are based on the results 
obtained by the Ss within the two experimental 
groups, ie, the nine Strong and the nine 
Weak Ego Ss as selected by the combined 
criteria of the self-rating scale and interview, 
described above. 

Differences in percentage recall scores ob- 


"This score was subsequently identified as the 
score of a Strong Ego S. 
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TABLE 3 
MEAN RECALL SCORES OF STRONG Eco AND 
Weak Eco Mate SUBJECTS TESTED UNDER 
Born TasK-ORIENTED AND 
EcGo-ORIENTED CONDITIONS* 


Task- _Ego-_ 
Orientation Orientation 
“In € ih C 
TE 7 57 26 46 
Strong Ego Ss k: 
Weak Ego Ss -29 -48 .31 .32 a 
` P* P* 
.07 -25 
e rN) 06 
In—C 45 -01 


a Percentage recall scores converted into angles by the arc 
i Sercentage i 27, Table 16.8, p. 449). 
n+/ percentage transformation (27, Ta 
T by means of Mann-Whitney U Test (14). 


tained by the Ss in the two experimental 
groups are shown in Table 3. Under Task 
Orientation, the Strong Ego group tends to 
recall more interrupted tasks than the Weak 
Ego Ss (P = .07). The Weak Ego Ss recall 
more completed tasks than do the Strong 
Ego Ss, but the difference is not statistically 
significant (P = .27). There is also no differ- 
ence in the so-called Zeigarnik effect, In — C 
(P = .45), though the difference is in the 
expected direction. 

Under Ego Orientation, Weak Ego Ss recall 
more interrupted tasks than do Strong Egos, 
but the difference is not statistically significant. 
Strong Ego Ss tend to recall more completed 
tasks than do Weak Egos, and this difference 
approaches significance (P = .06). The differ- 
ence in recall as measured by In — C for the 
two groups is significant (P = .01). In other 
words, the discrepancy in favor of the greater 
recall of completed than of interrupted tasks 
is greater for the Strong Ego Ss than for the 
Weak Ego Ss. 

Performance of the Sirong and Weak Ego 
Groups within each orientation. Tables 4, 5, and 
6 summarize the group differences for the 
performance variables. Table 4 shows that 
under Task Orientation, the two groups are 
equally efficient at solving the sentences. 
Under Ego Orientation, both groups lose in 

efficiency, the Strong Ego Ss losing less than 
the Weak Ego Ss, but the differences are not 
significant. i 

Since each solvable sentence ae four 
iitjons, a more sensitive measure of per- 
ane loss from Task Orientation to Ego 
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Orientation is afforded by the mean number of 
alternative solutions achieved under each 
condition. As Table 5 shows, the mean for 
Weak Ego Ss under Task Orientation is higher 
than for Strong Egos, but the difference is not 
significant. Under Ego Orientation, both 
groups show a drop. When differences between 
the two sessions are analyzed, the loss for the 
Weak Ego Ss is found to be significantly 
greater than for the Strong Ego Ss (P = .05). 

The poorer performance of Weak Ego Ss, as 
the instructions become more ego threatening, 
may result from increased rigidity, i.e., having 
once found a solution they are less able than 
Strong Egos to disturb that structure and to 
attain an alternative solution. The possibility 
of greater rigidity in Weak Egos is supported 
by the data concerning time to achieve a 
first solution. These data, summarized in 
Table 6, show no significant difference between 
groups in time required to achieve a first 
solution. In other words, Weak Ego Ss are 
not slower to find an initial solution to the 
problem when the instructions become more 
ego threatening (Table 6), but are significantly 
less able to restructure and achieve additional 
solutions within the two-minute time limit per 
sentence (Table 5). 

As Table 7 shows, there is no significant 
overall memory loss as the conditions become 
more threatening to self-esteen. The differences 
within each experimental session, moreover, 
do not even approach significance. 

The selective recall differences repor ted 
above, therefore, represent qualitative differ- 
ences as a function of ego strength, not quanti- 
tative differences in total memory. The per- 
formance data indicate both qualitative and 
quantitative group differences. Under objec- 
tively non-self-esteem threatening conditions, 
the performance of the two groups is the same- 
When self-esteem is objectively threatened, 
however, Weak Ego Ss show the greater loss 
in efficiency. 


Part II 


THE RELATIONSHIP BETWEEN EGO 
STRENGTH AND N ACHIEVEMENT 


In order to fit the data of Part I into the 
results of recall studies done within the Mc- 
Clelland ef al. (18) framework of achievement 
motivation, the relationships between % 
Achievement as a single parameter of per- 


a 
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P TABLE 4 
ANALYSIS OF VARIANCE FOR NUMBER OF SENTENCES SOLVED BY STRONG AND WEAK Eco COLLEGE MALES 
UNDER Two EXPERIMENTAL CONDITIONS 


Task Orientati D . ; Difference Between Orientations 
10n Ego Orientation (Ego — Task) 
| Sum of | Var- S f | Var- S f | Var- 
d k a um of | Var- um of | Var- 
X 4 Squares fiance á af Squares jiance af Squares |iance F 
Joe. 17| 10.3 Total 17 | 24.3 Total 17] 27.4 
eeween |1| 1 | A |.16) Between |1| 4.5 [4.5 [3.66] Between / 1) 5.5 [5.5 4.07 
rie | s Ss 
Within 16 | 10.2 -64 | Within 16 | 19.8 |1.23 Within 16 | 21.6 |1.35 
Strong Egos Weak Strong Egos Weak Strong Egos Wee 
Egos E Me Egos rong ag: Egos 
Mean 5.3 5.4 Mean 5.1 4.1 Mean —0.2 —1.3 
TABLE 5 


ANALYSIS OF VARIANCE FOR NUMBER OF ALTERNATIVE SOLUTIONS By STRONG AND WEAK Eco COLLEGE MALES 
UNDER Two EXPERIMENTAL CONDITIONS 


Difference Between Orientations 


Task Orientation Ego Orientation (Ego — Task) 
| Sum of | Var- Sum of | Var- Sum of | Var- 
af Squares fiance af Squares |iance of Squares fiance F 
Total 17 | 133.8 Total 17 | 73.8 Total 17 | 187.5 
Between 1 8.1 | 8.1 |1.03| Between 1 J6 |563 Beirona a 44.5 |44.5 |5.0* 
S Ss n 
Within 16 | 125.7 | 7.9 Within 16 | 68.2 | 4.3 Within 16 | 143.0 | 8.9 
Strong Egos ey Strong Egos veer Strong Egos a 
Mean 8.5 9.8 Mean 6.7 5.6 Mean —1.8 —4.2 
* Indicates significance beyonnd the .05 level of confidence. 
TABLE 6 


ANALYSIS or VA 


RIANCE FOR AVERAGE TIME IN SECONDS TAKEN BY STRONG AND WEAK COLLEGE MALES 
TO ATTAIN A First SOLUTION UNDER TWO EXPERIMENTAL CONDITIONS 


i ý Ego Orientati Difference Between Orientations 
Task Orientation Bgo Orientation (Ego — Task) 
y 
Sum of | Var- Sum of | Var- Sum of | Var- 
df Squares | iance af Squares | iance F af Squares | iance F 
17 [3766.54 Total |17 [7212.72 Total |17 [5304.37 
Tota a | 21:35 | 21.35}.o1 Between |1] 90:60 | 90.60].20] Between |1| 23:35 | 23.39.07 
s s 
win he [3745.19 [234.07] | Within ft6 [7122.12 |455.13] | Within [16 [5281.02 [330.06 
Weak Weak Weak 
Strong Egos| Egos Strong Egos Egos Strong Egos Esos 
an 68.7 70.9 Mean 78.3 82.7 Mean +9.6 +11.9 


the personality syndromes Ego 
Weakness have to be ex- 
chievement was not meas- 
a new experiment 


sonality and 
Strength and Ego 
amined. Since 7 A 
ured in the prediction study, 


was designed. The specific question it asks is: 
Do Ss who score high on 2 Achievement, as 
measured by the TAT method of McClelland 
et al. (18), more often fit only one of Alper’s 
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TABLE 7 
SENTEN ¢ STRONG AND WEAK Eco COLLEGE MALES 
SALYSIS si OR NUMBER OF SENTENCES RECALLED BY 
AEE R UNDER Two EXPERIMENTAL CONDITIONS 


| 3 m Diference Between Orientations 
Task Orientation | Ego Orientation (Ego — Task) 
E| Var- | p Sum of | Var- | z Sum of | Var- | p 
af ee | iance df | Squares | iance | F af Squaree | iance 
| \ 
; P | Total 17| 54.9 WEZ 17| 61.8 
Total x 30:2 2.7 \1.5| Between 1 13: 1.3 |.38} Between 1 32 2 |.05 
Between | | Ss | Ss 
wim hel 28.2 | 1.8 | | Within l6 | 53.6 | 3.4 | | Within [16] 61.6 | 3.85 
| Weak Weak Weak 
Strong Egos p iaia Strong Egos Eeo Strong Egos Egos 
| l 
Mean 5.4 | 47 | Mean 4.2 3.7 | Mean 1.2 -1.0 


two personality syndromes, and Ss who score 
low on n Achievement, the other? 

Although the Ss in this experiment were 
college females, rather than males, the sex 
factor should not affect the relationships being 
studied here since: (a) the design of the 
present experiment takes into account the 
conditions that should assure a high scoring 7 
Achievement group; and (b) there are no 
reports of sex differences in the recall of 
interrupted vs. completed tasks. Indeed, in 
the Atkinson and Raphelson (6) study, the 
authors included the recall results of 22 female 
Ss “to add stability to the recall scores of 
men” (6, p. 352). They do not report separate 
recall results for the female sample. 

Sex differences have been reported, however, 
in studies of n Achievement. McClelland et al. 
summarize them as follows: “(1) women get 
higher n Achievement scores than men under 
neutral conditions-two studies; (2) Achieve- 
ment-Orientation does not increase x Achieve- 
ment in women-three studies; (3) women’s 2 
Achievement scores seem as valid as men’s in 
that they relate to performance in the same 
way; (4) the failure to find an increase in n 
Achievement score from achievement arousal is 
probably not due to the fact that the scores 

are already at a maximum” (18, p. 178). The 
clue for arousing higher n Achievement scores 
in women is seen by McClelland et al. to 

come from a study by Field (10) who reported 
that n Achievement in women is tied up with 
social acceptability rather than with reference 
to leadership capacity and intelligence. In 
Field’s study, for example, females who were 

a ir classmates as “liked” scored 
rated by their cla 


higher on 2 Achievement than those rated as 
“disliked,” the ratings having been made 
after n Achievement had been measured and 
without advance knowledge that such ratings 
would be asked for. McClelland et al, con- 
clude from these data that “if you want to 
arouse 7 Achievement in women, refer, as 
Field did, to their social acceptability; if you 
want to arouse 2 Achievement in men, refer, 
as we did to their leadership capacity and 
intelligence” (18, p. 181). Since Morrison (20) 
has subsequently confirmed Field’s thesis by 
finding that female college leaders score higher 
on 2 Achievement than do female nonleaders, 
we may paraphrase McClelland et al., as follows: 
if you want to be assured of a high n Achieve- 
ment scoring female sample, use college leaders 
as Ss. Such students formed the experimental 
group in the present study. 

An added advantage of a leader group for 
Present purposes is the relationship reported by 
Atkinson and Raphelson (6) between selective 
recall and v Affiliation (“motivation for social 
acceptance” (6, p. 350)). Under a very relaxed 
orientation, Ss who scored high on x Affiliation, 
as measured by the TAT technique, recalled 
more interrupted than completed tasks, while 
Ss who scored low on Affiliation showed the 
opposite recall pattern. Both Atkinson (5) and 
Atkinson and Raphelson (6) report that SS 
who score high on m Achievement under 
Achievment Orientation, recall more inter- 
rupted tasks than do Ss who score low on 7 
Achievement. These findings taken together 
suggest the possibility that Ss who score high 
on # Affiliation and low on n Achievement 
might show the recall pattern of Alpers 


a 


= 


A 
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Strong Ego Ss; namely, greater recall of 
interrupted tasks under Task Orientation, of 
completed tasks under conditions which 
objectively stress that S’s performance is 
being measured. Similarly, Ss who score low on 
n Affiliation and high on x Achievement might 
be expected to show the reverse recall pattern 
characteristic of Alper’s Weak Ego Ss. The 
fact that the recall of completed tasks has 
been found in the past to be a function of in- 
creased experimental stress, not of the strength 
of S’s x Achievement, may function against the 
hypothesized relationships between Ego 
Strength, n Affiliation, 7 Achievement, and 
recall pattern. On the other hand, since 
Atkinson and Raphelson (6) do not report 
how high » Affiliation Ss scored on » Achieve- 
ment, though both measures were obtained 
for each S, nor have they tested the selective 
recall of the same Ss under both Task and 
Achievement Orientation, there is at present 
no clear contrary evidence against the hy- 
pothesis. 


Method 


The Ego Strength measure. The Psychological Insight 
Test, as described in Part I, was used to measure Ego 
Strength. Designed originally for use with college 
males, it was revised for female Ss only by changing 
_imale’ to “female,” “men” to “women,” “his” to 

her,” etc., in the directions and in the body of the 
Scale. As in Part I, it was taken under “take home” 
Conditions. In the instructions, Æ attempted to keep 
the conditions as nonthreatening to self-esteem as 
Possible by emphasizing both that names would be re- 
moved after coding and that the results would be used 
as part of a study of college students in general, not 
as tests of the participating individuals. 

The algebraic scoring scheme, as described in Part 
I, was used to determine S’s score on each of the nine 


variables measured. To fit the Strong Ego pattern, S's 
e scores on # Recogni- 


Self-ratings would involve positiv J 
tion, » Dominance, Ego-Strength and Conative Con- 
Junctivity, negative scores on Dejection ves 
and Ego-Ideal Intragression. To fit the Weak Ego 
Pattern, §’5 self-ratings would involve negative Se 
on # Recognition, n Defendance, # Counteract: 

Achievement and Narcism, positive scores on ee 
tion-Pessimism, and Ego-Ideal Intragression. Ihe 


S r — pattern of scores, 
trong Ego, thus, would havea GA Fi "i pattern can 


the Weak Fy —, + pattern 
ak Ego a P 1 
be regarded as an inconsistent Strong Ego parten a 
=, — pattern an inconsistent Weak Ego pa' : 
18 Tt is quite likely that such instructions = 
more ego-threatening to some, less to others. o ia 
them nonthreatening to self-esteem, then, is to s 


2's inte ’s involvement. 
nt, not S’s involver ý i ù 
3 Quantitative scores were obtained, as well as this 


Pattern score, by following the scheme described in 
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„The n Achievement measure. The choice of picture 
stimuli for measuring n Achievement was influenced 
by: (a) the finding by Veroff et al. (28) that pictures 
of male characters yield higher » Achievement scores 
than pictures of female characters for both male and 
female college Ss; (b) the writer's clinical experience 
with the usefulness of particular TAT pictures for 
evoking achievement needs; and (c) an interest in 
measuring parent-child relationships as perceived by 
high and low scoring Ss on the » Achievement variable. 
These criteria resulted in the selection of four pictures: 
Cards 1 and 7GF from The Thematic Apperception Test 
(22), i.c., boy and violin, and young girl and mother- 
figure, respectively, and Cards 10G and 11G from The 
Michigan Picture Test (19), i.e., young girl and father- 
figure, and young girl in empty classroom. The 2 
Achievement scores reported here are based only on 
two of these, the boy and violin, which are listed among 
the McClelland eż al. (18) stimuli for measuring » 
Achievement, and the girl in classroom which can be 
considered a female counterpart of their Card H, “a 
boy in checked shirt at desk, an open book in front of 
him” (18, p. 375). Since the young girl is typically seen 
as about ten years old, as is also the case for the boy 
in the the boy and violin picture, when the storyteller 
mentions age at all, these two pictures are particularly 
well matched for tapping reactions to sex differences 
of the stimuli. 

To facilitate group presentation of the pictures, 
permission was obtained from the respective publishers 
to make the necessary slides as well as certain changes 
in the stimuli. For example, in Card 11G, the back- 
ground of the empty classroom was blurred, giving the 
background less “an empty classroom” appearance. 

Experimental design. Three groups of college females 
were used as Ss in this experiment, an experimental 
group and two control groups. The experimental group 
was comprised of 28 upperclassmen, all of whom had 
been elected to high office by their peers. Achievement 
motivation was tested by a college senior, herself a 
college leader.” The conditions of testing and the 
instructions were casual, the intention being merely to 
task-orient these Ss. To this end, the group was told 
by the student Æ that “some studies of college students 
were being made and that they had been asked to help.” 
A second group, 24 upperclassmen in an advanced 
course in psychology, served as a sample of upperclass 
nonleaders. Office holders in the group were eliminated 
from the sample. The test was administered by the 
instructor in the course immediately following the 
completion of an hour examination in the course in 
which these students were enrolled. The instructions 
emphasized that the test measured personality and 
that it would be scored by the instructor. This group, 
therefore, was an ego-oriented group and was tested 
immediately following a real-life achievement situation 
under ego-oriented conditions. The third group was 


Part I. Only pattern scores are considered here since 
the concern is merely with presence or absence of a 
given self-rating pattern in relation to the » Achieve- 
ment variable. 

20 The writer wishes to thank Miss Maud Hazeltine, 
President of Student Government at Wellesley College 
in 1955-56, for administering the test to the leader 


group. 
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rised of 24 students enrolled in an introductory 
ee in psychology. These Ss, largely sophomores, 
constituted another nonleader sample. Their achieve- 
ment motivation was measured during a regular class 
hour under casual task-oriented conditions. The in- 
structions emphasized that this was merely a demon- 
stration of a psychological technique. Since demon- 
strations were frequent in the course, this device was 
used for task-orienting these Ss. a 

In all three groups, the test was administered under 
group testing conditions. The McClelland et al. (18, 
i 98) instructions and questions followed the special 
Bee cians, task- or ego-oriented, for the given group, 

bove. 
j an predictions were made: : 

1. Following Morrison (20), the mean x Achieve- 
ment score of the task-oriented leader group will be 
significantly higher than the mean x Achievement score 
of the ego-oriented nonleader group and of the task- 

i ader group; 
ie Veroft a (28), the mean 7 Achieve- 
ment score of the ego-oriented nonleader group will 
not differ significantly from the mean 7 Achievement 
score of the task-oriented nonleader group. 

3. Following Atkinson (5) and Atkinson and 
Raphelson (6), in the leader group only, Ss whose self- 
ratings fit the Strong Ego pattern will more often score 
low on n Achievement than high on » Achievement, 
while Ss whose self-ratings fit the Weak Ego pattern 
will more often score high on n Achievement. The pre- 

diction was limited to the leader group since only this 
group meets the criterion of high » Affiliation required 
by the Atkinson and Raphelson (6) recall findings as 
well as Morrison’s (20) n Achievement findings for 
female Ss. 


Results 


Scoring of Ego Strength and of n Achieve- 
ment was done by the writer. Since the Ego 
Strength measure entails merely arithmetical 
addition of Ss’ self-ratings, it presents no 
problem with respect to scorer reliability, a 
problem which arises, however, with the 7 
Achievement measure. Two steps were taken 
to check on the reliability of the writer’s 
scoring of n Achievement. First, several 
months after the initial self-instruction period 
with the McClelland et al. Scoring categories 
(18, Chap. IV), the writer rescored the first 
story of each of the 30 Ss included in their 
Appendix I. A rank-order correlation test of 
the writer’s scores checked against the scores 
reported for these stories by McClelland et al 
(18) in their Appendix IT yielded a tho value 
of +.85. Secondly, to check the writer’s 
self-consistency for scoring Achievement, 
she rescored the stories told by the leader 
group one month after her original Scoring. 
The rho value here was +.96. Since these 
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TABLE 8 
MEAN n ACHIEVEMENT SCORES OF COLLEGE 


FEMALES UNDER VARIOUS ORIENTATIONS 


(Stimuli Were TAT Card I and Michigan 
Picture Test Card 11G) 


eee Nonleaders, Nonleaders, k | 
task-oriented €80-oriented task-oriented »_ 
Mean n 3.1 4.1 3.5 
Achievement 
Score : 
ig F | 
College leaders — non- —.85 50 .50 — .40 A 
leaders, ego-oriented f: 
College leaders — non- —.34 50 .80 — .70 | 
leaders, task-oriented 
Nonleaders, ego-oriented— —.55 46 .60 — .50 


nonleaders, task-oriented 


values of rho compare favorably with other 
reliability tests cited by McClelland et al. fy 
(18, pp. 185-187), it is assumed that the \ 
scoring is reliable. "o 

Analysis of the total mean n Achievement 
scores based on stories told to two pictures (TAT 
Card 1 and Michigan Picture Test Card 11G). í 
The mean # Achievement scores for the three s 
groups of Ss are shown in Table 8. The data F 
indicate that the three groups do not differ on | 
n Achievement. The first prediction, that the 
leader group would score higher than the non- | 
leader groups, is therefore not supported. In { 
fact, on an absolute basis, the leader group > ! 
scored lowest, the ego-oriented nonleader 
group scored highest, the task-oriented non- 
leader group in the middle, though none of the 
differences are significant. 

Since the mean # Achievement scores of the 
two nonleader groups do not differ, the second 
Prediction is supported. The fact that one 
group was ego-oriented during the testing and 
the other task-oriented, in other words, does 
not affect their n Achievement scores. 
Analysis of the effect of the sex of the character 
the picture stimulus on n Achievement scores. 
To check the possibility that the three groups 
may have been responding differently to the 
two pictures, separate # Achievement scores 
were computed for each picture. Dependent t 
tests, comparing the scores on the two pictures 
within a group, yielded the results summarized 
in Table 9. These data indicate that the two | 
pictures are not equally effective in arousing 
n Achievement for all groups. The male N 
picture arouses significantly more 2 Achieve- ji 
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‘ment in the leader group than does the female 
picture, the mean of the former being 2.6, of 


the latter 0.7 (P < .05). The differences are 


in the same direction for the ego-oriented 
nonleader group, but they are not significant 
(P < .20). For the task-oriented nonleader 
group there is a reversal: the female picture 
tends to yield a higher mean » Achievement 


score than the male picture (P < .10). 


In view of these findings, the first two pre- 
dictions were tested for each picture separately. 
The results, analyzed by means of independent 


_» tests, are summarized in Table 10. 


As is shown in the left half of Table 10, 
Prediction 1 is partly supported for the male 
picture: » Achievement is significantly higher 
for the leader group than for the task-oriented 
nonleader group (¢ = 7.96; P < .001), but 
there are no differences between the leader 
group and the ego-oriented nonleader group 
(t = —.53; P < .60). 

Prediction 2 is zof supported since the mean 
n Achievement score of the ego-oriented non- 
leader group is significantly higher than the 
mean # Achievement score of the task-oriented 
nonleader group (t = 2.04; P = .05). 

The right half of Table 10 summarizes the 


TABLE 9 
MEAN » ACHIEVEMENT SCORES OF COLLEGE 
FEMALES BASED ON STORIES TOLD TO A 
MALE AND TO A FEMALE PICTURE 


STIMULUS 
Fe- 
Mal 
Group Yue lmale | p | df | P 
ture ture 
College leaders, 2.6 .7 | 2.42] 27 05— .02 
task-oriented S 
Nonleaders, ego- | 3-0 | 1-4 1.47| 23 |.20—.10 
oriented = B 
Nonleaders, task- | 1-3 2.3 |—1.92} 23 |.10—.05 
oriented 
TAB 


D CHIEVEMENT SCORES OBTAL 
iia PICTURE 
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results for the female picture. Again the leader 
group does not differ from the ego-oriented 
nonleader group (t = —.56; P < .60) but 
does differ from the task-oriented nonleader 
group. The female picture yields a significantly 
lower mean n Achievement score for the task- 
oriented leader group than for the task- 
oriented nonleader group (¢ = —2.16; P = 
.05). The two nonleader groups also differ on 
the female picture, the ego-oriented nonleader 
group scoring significantly lower on # Achieve- 
ment than the task-oriented nonleader group 
(t = —5.43; P < .001). 

In brief, these results indicate that: (a) the 
sex of the character in the picture stimulus 
does affect the strength of the n Achievement 
score of college females; (b) # Achievement is 
more likely to be aroused in a female S by a 
male than by a female character if S either 
values social acceptance or is being tested in 
an achievement atmosphere; (c) # Achieve- 
ment is more likely to be aroused in a female 
S by a female than by a male character if S 
does not value social acceptance and she is 
being tested in a task-oriented atmosphere. 

Relation between Ego-Strength patterns and n 
Achievement. The Psychological Insight Test 
scores were labeled Strong Ego, Weak Ego, 
Not Consistently Strong, or Not Consistently 
Weak according to whether the score fitted a 
+, —;a —, +;a +, +; ora —, — pattern, 
respectively, as described earlier. The fre- 
quency of these patterns within the leader 
and nonleader groups is shown in Table 11. 
The two nonleader groups were combined here 
since a chi-square analysis revealed them not 
to be different (x2 = .75; P < .90). 

In all groups, the Weak Ego pattern occurs 
rarely. The Strong Ego pattern occurs most 
frequently, but not more frequently than 
would be expected by chance alone when 


LE 10 


NED BY COLLEGE FEMALES ON STORIES TOLD TO A MALE AND TO A FEMALE 


STIMULUS 


a Male Picture 


Female Picture 


Group 7 7 F = : z > 
College leaders, task-oriented—nonleaders, —.53 50 .60 — .50 = (86 50 60 — .50 
~orient S j 
Célloge kader task-oriented—nonleaders, 2.04 50 05 216 50 05 
task-oriented ee an i pom y a R 


Nonleaders, ego-oriented—nonle 


task-oriented 


160 


TABLE 11 
PATTERNS OF PSYCHOLOGICAL INSIGHT TEST 
SCORES OBTAINED BY FEMALE COLLEGE 
LEADERS AND FEMALE COLLEGE 
NONLEADERS 


Psychological Insight Test Patterns 


Not Con- Not Con- 
Gre N Strong Weis sistently sistently 
Ego go Strong Weak 
6 
Leaders 28 16 2 4 
Nonleaders 46 21 6 Gg 4 
TABLE 12 


RELATIONSHIP BETWEEN EGO STRENGTH AND 
n ACHIEVEMENT OF FEMALE COLLEGE 
STUDENTS 


(Analysis by chi square*) 
| 


No. Scor- | No. Scor- 
ing Above| ing Below 
yis n ae n 
Achieve- chieve- 2 
N ment ment ji e 
Mean of | Mean of 
the Total | the Total 
Group Group 
Strong Ego Ss 16 3 13  16.31|<.02 
Weak Ego Ss 2 0 2 — | = 


® The correction for continuit: 


y was applied to these data. 
*df=1. 


Strong Ego frequency is compared with the 
frequency with which all other patterns com- 
bined occur. 

Table 12 shows the number of Strong 
Ego Ss in the leader group whose 7 Achieve- 
ment scores fell above the group’s mean, and 
the number whose scores fell below the group’s 
mean. Similar data are included for the Weak 
Ego Ss. 

The Strong Ego data indicate that signifi- 
cantly more Ss in the leader group whose 
self-ratings fit the Strong Ego pattern score 

below the mean of the group 
(P= 


The prediction 
leader group would score hi 


1 since th 
so infrequently in thi 


A prediction of ho 
leader groups would 
group means was 
Achievement literat 


€ pattern appeared 
S group. 
w Strong Ego Ss in non- 
Score with respect to their 

not made since the 7 
ure provided no basis for 
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making a prediction. The fact that in both 
nonleader groups as many Strong Ego Ss 
scored above their group mean as below it, 
however, strengthens the importance of the 
the inverse relationship between » Achieve- 


ment and Ego Strength found in the leader 
group data. 


Discussion 


The results reported in Part I of this paper 
support the position taken earlier by the 
writer (1, 2, 3) that selective recall is lawfully 
related to the personality structure of the 
recaller. In the present study, two personality 
patterns, the Strong Ego and the Weak Ego, as 
previously (2) described, were studied further 
by selecting in advance of the experiment Ss 
who fitted these patterns. All Ss were tested 
under two experimental conditions, Task 
Orientation first, followed by Ego Orientation 
a week later. It was found that correct pre- 
diction of group membership on the basis of 
the selective recall pattern alone was possible 
significantly beyond chance: the Strong Ego 
pattern consisting of greater recall of completed 
than of interrupted tasks as the experimental 
conditions become more threatening to self- 
esteem; the Weak Ego pattern, of greater 
recall of interrupted than of completed tasks- 

As in previous papers (2, 3), these results 
are interpreted to mean that Strong and Weak 
Ego Ss protect self-esteem in different way: 
In situations in which completion is likely to Þe 
experienced as personal success, interruptio™ 
as Personal failure, ie., when tasks are being 
performed under eg 
is not overwhelmed by the failure 
trong Ego supports bis 


Weak Ego utili 


zes an ego-offensive mechanis™ 
(2): he focuses 


on his failures, i.e., recalls more 
interrupted than completed tasks, admitting 
his failures before others face him with having 
failed. Both types of recall, then, serve t° 
reduce ego tensions under conditions exper” 


mentally designed to arouse ego, rather tha? 
task, tensions. 


Similar recal] 
very different 
recently by Atk 


Patterns, but interpreted 
Ways, have been reporte 
inson (5) and Atkinson 2” 


E 
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Raphelson (6). The data of Part II support a 
reinterpretation of their findings. As a result, 
the x Achievement and Ego Strength studies 
can be shown to have yielded uniform patterns 
of selective recall. 

Atkinson and Raphelson (6) have reported 
that Ss who score high on z Affiliation (moti- 
vation for social acceptance) as measured by 
the TAT technique of McClelland ef al. (18) 
under | the “very relaxed” orientation, recall 
more interrupted tasks than do Ss who score 
low on x Affiliation. As the conditions become 
more achievement oriented, the direction of 
selective recall correlates with z Achievement, 
not with z Affiliation. The Ss who score high 
on 2 Achievement tend to recall more inter- 
rupted than completed tasks, while Ss who 
score low on 2 Achievement recall more 
completed than interrupted tasks (5, 6). They 
interpret these results to mean that the S who 
recalls more interrupted tasks under relaxed 
conditions is reacting merely with task tensions. 
He wants to please the Æ and hence, like 
Zeigarnik’s (29) Ss, when a task tension can- 
not be resolved, because Æ has interrupted S on 
the task, the tension persists, making inter- 
rupted tasks more available for recall than 


completed ones. The greater recall of 
interrupted than of completed tasks by high 
rn by low # 


n Achievers, and the reverse patte: 
Achievers, under achievement-oriented con- 
ditions are explained as follows: high 7# 
Achievers have a “positive disposition to 
excel—are predominantly success-oriented and 
have as their goal the feeling of satisfaction 
accompanying personal accomplishment while 
persons lower in 7% Achievement are more 
concerned with avoiding feelings of failure” 
(5, p. 386), i.e, “lower n” Achievement scores 
imply relatively greater anxiety about failure” 


(5, p. 389). . 
Thus, recalling more of one’s failures 
(interrupted tasks) under achievement- 


oriented conditions, the pattern of the S who 
scores high on n Achievement, is interpreted 
by Atkinson to express S’s “positive dispo- 
sition to excel,” by Alper to mean that the 
Weak Ego, overwhelmed by the experience of 
failure, protects self-esteem by taking the 
offensive; he admits his failures before others 
accuse him of having failed. Recalling more of 
one’s successes (completed tasks) under 
achievement-oriented conditions, the pattern 
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of the low » Achiever, is interpreted by 
Atkinson (5) as an expression of anxiety over 
failure and by Alper as a way of supporting 
self-esteem through focusing on tasks one did 
complete. Seemingly opposite defense patterns, 
in other words, are seen to be operating to 
produce the same recall patterns. 

The results in Part II of this paper provide 
a basis for showing how the two interpretations 
may be brought into a common theoretical 
framework. By obtaining both Ego Strength 
and 2 Achievement measures on female 
college leaders, a group which according to 
previous studies should score high on 2 
Achievement (10, 20), it was found that Ss 
whose self-ratings fit the Strong Ego pattern 
more often score below the group mean on # 
Achievement than above it. No relationship 
was found between # Achievement and the 
Strong Ego self-rating pattern in two control 
groups. Since, by definition, the leader group 
should value social acceptance, i.e., score high 
on Affiliation, we now can relate the Atkin- 
son (5) and Atkinson and Raphelson (6) 
findings to the Strong Ego selective recall 
patterns isolated by the writer. The greater 
recall of interrupted tasks under the Relaxed 
Orientation by Ss who score high on x Afħili- 
ation (6), and of completed rather than in- 
terrupted tasks under conditions which 
objectively stress achievement by Ss who 
score low on n Achievement (5, 6), is exactly 
the recall pattern expected of a Strong Ego 5S. 

While the relationship between high x 
Achievement and the Weak Ego self-rating 
pattern could not be adequately tested in 
Part II because so few Ssin the leader group 
fitted the Weak Ego pattern, the inverse 
relationship between the Strong Ego self- 
rating pattern and 7 Achievement permits at 
least a partial reinterpretation of the role # 
Achievement plays in selective recall. The 
results of Part II do not support Atkinson’s 
(5, p. 389) proposition that “lower n Achieve- 
ment scores imply relatively greater anxiety 
about failure” if low 2 Achievement is com- 
bined with the personality structure of a 
Strong Ego S. On the contrary, these Ss are 
able to counteract the objective failure threat 
(the Strong Ego Ss in Part I obtained signifi- 
cantly more alternative solutions than did 
Weak Ego Ss). Counteraction suggests 
positive, constructive forces at work rather 
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than a mechanism of “anxiety-avoidance.” 
In other words, it may be the high n Achievers 
as well as Zeigarnik’s (29) “ambitious” Ss, not 
the low 2 Achievers, who have the relatively 
greater anxiety about failure. , , 
Further support for these interpretations 
comes from an analysis of the self-ratings of the 
leader group on the nine individual personality 
variables included in The Psychological Insight 
Test. As a group, these socially accepted Ss 
significantly more often rated themselves high 
rather than low, i.e., above the average for 
college women, not below the average, on n 
Counteractive Achievement, Conative Con- 
junctivity and Ego Strength, and low rather 
than high on x Recognition, Dejection- 
Pessimism and Ego-Ideal Intragression. The 
variables on which they score equally often at 
the high or the low end are Narcism, n Defend- 
ance, and 7 Dominance. In this group, the Ss 
accept as characteristic of themselves, then, 
only those variables which could be regarded as 
having a positive value in our culture. This 
finding is consistent with Martire’s (17) 
finding that low » Achievers are less dissatis- 
fied with themselves, less in “self-conflict,” 
than are high 7 Achievers. 

The low 2 Achiever who is a Strong Ego, in 
other words, is low on achievement motivation 
as long as the objective situation is unrelated to 
achievement. His Achievement needs are 
not constantly under tension. Yet when achieve- 
ment is realistically called for, and failure isa 
realistic possibility, the Strong Ego can and 
does counteract it. The counteraction shows 
itself both in his selective recall and in his 
performance data. It should not be surprising, 
therefore, to find significant differences in 
selective recall between high and low x 
Achievers consistently reported only for 
Achievement Orientation (5, 6), while per- 
formance studies of high and low 2 Achievers 
continue to yield inconsistent results (6, 7, 12, 
27). In some of these studies, high 7 Achievers 

n Achievers under 
and Achievement 


r | Ounteractive 
to differentiate, for example 
m Achievers who a : 


i re high 
Achievement and high k 5 
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Achievers who are low on z Counteractive 
Achievement. Moreover, while low # Achieve- 
ment and high » Counteractive Achievement 
appears to be the Strong Ego pattern, theo- 
retically, Weak Ego Ss could fall either into a 
low n Achievement, low 7 Counteractive 
Achievement or high x Achievement, low 7 
Counteractive Achievement group. The two 
Weak Ego Ss in the present leader group, for 
example, would fit the first pattern since they 
rated themselves low on n Counteractive 
Achievement and scored low on n Achievement 
as measured by the TAT technique. 

While the relationships between low n 
Achievement and Strong Ego self-ratings hold 
Statistically, and thus Warrant the reinter- 
Pretation of the selective recall findings of 
studies done within the 7 Achievement frame- 
work, certain problems in the data of the 
present study, and certain objections which 
can be raised concerning them, need to be 
considered. The first problem centers around a 
difficulty raised in Part I. The recall patterns of 
Strong and Weak Ego Ss were not Statistically 
different under Task Orientation, though the 
overall differences between the sessions were 
Significant. The earlier study (2) had indicated 
that both orientations would yield significant 
recall differences within sessions. The absence 


of significant recall differences under Task 
Orientation js 


Study (see 3, 
explanation is 
have been j 
also have affected 


h the instru 
those Previously us, 
the Ss were familia 
with the clinic pe 
clinic many times 
i an extensive 


Ctions were the same as 
ed. In the earlier study (2), 
r with the clinic setting and 
tsonnel. They came to the 
since all were participating 
Personality study. In the 
t, the Ss served only in the 
If this explanation has 
s that at a given point in 
nuum, ego tensions begin to 
g Ego Ss also, and at this 
no differences in the recall 
Patterns of the two groups of Ss. At a further 


point along the continuum, the recall patterns 
would again differ, 
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A second problem in relating the recall 
patterns emerging in the »# Achievement 
studies to Alper’s patterns, stems from possible 
differences in an “Achievement” vs. an “ego 
oriented” or “self-esteem threatening” atmos- 
phere. While the terms themselves do not 
necessarily indicate important motivational 
differences, the writer would agree that so 
long as the conditions used to arouse the 
motive center around achieving, i.e., making a 
good record for oneself against some objective 
standard of excellence, the term “Achievement 
Orientation” expresses very precisely the 
motivation the Æ intends to have aroused. Yet 
in both the Atkinson and in the Alper studies, 
the arousal of more than achievement needs 
seems to have been intended since in both, the 
S is prevented from doing well on (i.e., com- 
pleting) “tests.” In the Atkinson study, S was 
told that the tests were “developed by psychol- 
ogists during the war yeras in order to select 
people with high executive capacity, intellectual 
alertness, the capacity for making quick 
decisions, and leadership” (5, p. 382). In 
Alper’s study, the tests were presented as “a 
brief intelligence test which the Army was 
finding useful in selecting candidates for 
Officer Training School” (2, p. 106). In other 
words, in both studies it was intended that S 
experience failure and that self-esteem be 
threatened. It is quite possible, of course, that 
Alper’s instructions would be more threatening 
to draft age males than Atkinson’s, if only 
because the incompletion occurred under more 
public conditions in the one than in the other; 
in Alper’s study, an accomplice, of college age, 
“succeeded” on tasks which S was experi- 
mentally doomed to fail. If these differences 
affect recall, however, theoretically they should 
sharpen rather than alter the direction of recall 
differences. 

A third problem arises with respect | to 
whether the correlation between self-rating 
scores and Achievement scores (the procedure 
followed in Part IL) should be taken at face 
value since recent studies of the relationship 
between self-rating scores on achievement 
(» Achievement) and TAT measures of 
achievement motivation ( Achievement) 
are not entirely consistent. Two studies (8, 9) 
have used male Ss, one (20) has used female Ss. 


For male Ss, de Charms ¢t al. (9) report a 
Positive correlation between ? Achievement 


and Achievement, the P value they obtained 
being <.05. But Child eż al. (8) found no 
relation between v Achievement and z Achieve- 
ment in their study of college males. For a 
female sample of college leaders, Morrison (20) 
reports that v Achievement and # Achievement 
are not correlated, while in the present study 
of female college leaders, there was a significant 
inverse relation between self-rating scores and 
n Achievement for the Strong Ego Ss (P < 
.02). 
The only other data available for comparing 
the relationships between v Achievement and 
n Achievement center on the F-scale scores on 
males in the de Charms ef al. study, and on 
females in the present study. For males, de 
Charms ef al. (9, p. 416) report that v 
Achievement correlates positively with total 
score on the F scale, while # Achieve- 
ment does not. In the present study, ^ 
Achievement was also not related to F-scale 
scores, but self-rating scores were: Strong 
Ego Ss tended to score below the group 
mean on the F scale (P < .10), while Ss 
who did not fit the Strong Ego pattern, 
treated as a group, significantly scored above 
the group mean on the F scale (P < .05). In 
other words, it is the non-Strong Ego Ss who 
are the more conformist, the more rigid Ss 
among these college leaders, including the 
two Ss whose self-rating scores fitted the 
Weak Ego pattern. The interpretation in 
Part I, that Strong Ego Ss were less rigid than 
Weak Ego Ss, as measured by their ability to 
obtain more alternative solutions in the 
scrambled phrases task, is consistent with the 
F scale findings in Part II and with the data 
in the Child et al. (8) study. 

Since many variables have been shown to 
influence the strength of the n Achievement 
variable, including the factor of the sex of the 
picture stimulus, further experimentation is 
necessary before the differences in findings of 
various studies can be clearly understood. The 
specific pictures used, the experimental 
atmosphere in which z Achievement is meas- 
ured, even the questionnaire items”! used in the 
self-rating measurements have varied greatly 


21 Tn all studies, the items have been derived from 
the Murray et al. (21) approach to measuring need 
strength by the questionnaire method. The length of 
the scales and the specific items covered have differed, 


however. 
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from one study to another. On the other hand, 
the relationships among these variables Te- 
vealed in the present study are consistent 
with the writer’s interpretation that Strong 
Ego Ss score low on n Achievement essentially 
because their achievement needs are not under 
tension rather than because of concern “with 
avoiding feelings of failure” (5, p. 386). This 
interpretation is an essential link for relating 
the selective recall pattern of the low x 
Achiever in the Atkinson (5) and Atkinson and 
Raphelson (6) studies to Alper’s Strong Ego 
pattern and thus for reaffirming both the 
theoretical framework presented earlier by the 
writer (2) and its applicability to previously 
reported contradictory or equivocal findings of 
other studies (3). 


SUMMARY 


Two experiments are reported here. The 
first was designed to validate the two major 
recall patterns that had emerged in a previous 
study of the relationship between personality 
structure and the recall of interrupted vs. 
completed tasks, the Strong Ego pattern and 
the Weak Ego pattern (2). For this purpose, 
two groups of college males, a Strong Ego and 
a Weak Ego group, were selected in advance 
of the experiment by means of a specially 
constructed self-rating scale and a clinical 
interview. Otherwise, the design of the experi- 
ment, procedure, and materials were the same 
as in the previous study. Using only the total 
recall score pattern as a measure of the change 
in direction of recall from a task-oriented to an 
ego-oriented session, it was possible to predict 
correctly the group membership of these Ss, as 
Strong or as Weak Ego Ss, significantly 
beyond chance. These results confirm, there- 
fore, the earlier finding that Strong Ego Ss 
recall significantly more completed tasks as the 
experimental conditions become 
ing to self-esteem; Weak 
tasks. In the present study. 
of the two groups did no 
within the task-oriented 
within the ego-oriented session, 


: 8S reported þ 
Atkinson (5) and Atkinson and Rapheler, 


(6). These authors have reported that high z 
Achievers, as measured by the TAT method of 


McClelland ef al. (18), recall more interrupted 

than completed tasks as the experimental 

conditions become more clearly achievement- 
oriented, while low x Achievers show the 
opposite recall pattern. 

Three groups of female Ss were used, and 
both # Achievement and Ego Strength were 
measured. One group was made up only of 
college leaders who had been elected to high 
office by their peers. Following Morrison (20), 
these Ss should be a high » Achievement 
Scoring group and could serve, therefore, as 
the experimental group. Two other nonleader 
groups served as controls. A significant in- 
verse relationship between Strong Ego self- 
rating scores and n Achievement was obtained 
only for the leader group. The two measures 
were not correlated in the control groups. 
These findings are used to show that the recall 
pattern of the low x Achiever, as reported by 
Atkinson (5) and Atkinson and Raphelson (6), 
corresponds to Alper’s Strong Ego pattern. 
The usual interpretation of a low x Achieve- 
ment score as implying “anxiety over failure’ 
is re-examined in the light of these findings. 

It cannot be concluded from these data, 
however, that the selective recall pattern of a 
high z Achiever would necessarily fit Alper’s 
Weak Ego pattern. Not only were there too few 
Weak Ego Ss in the college leader sample to 
test the relationship between Ego weakness 
and 2 Achievement, but also, and of more 
importance from the point of view of 
theory, Ego Strength and Ego Weakness, 
as measured in this study, are not 
at opposite ends of a single personality 
continuum. We cannot assume, therefore, that 
Ego Strength and Ego Weakness would 
function with respect to 2 Achievement in 
diametrically opposite ways. Yet given a high 
n Achiever whose self-ratings are congruent 


with the Weak Ego personality pattern, 
Including low 72 Counteractive Achievement, 
it woul 


d be consistent with the results of these 
experiments to expect him to show the selec- 
tive recall and performance patterns character- 
istic of Weak Ego Ss. 

The theoretical framework presented earlier 
by the writer (2) is thus reaffirmed. The useful- 
ness of this framework for reconciling 
seemingly Contradictory selective recall and 
performance data reported in other studies, as 
previously discussed (3), is also reaffirmed. 
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Specific findings concerning # Achievement 


in females, including the importance of the sex 
of the picture stimulus used and the experi- 
mental atmosphere in which # Achievement is 
measured, are discussed. 
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NSHIP BETWEEN ADVANCED AGE AND MENTAL 
THE RELATIO C ETETTIES 


CHARLES R. STROTHER, K. WARNER SCHAIE, anp PAUL HORST 
University of Washington 


N RESEARCH on aging, longitudinal studies 
in which the same population is followed 
over a long span of years are obviously 

to be preferred. There are an increasing number 
of such studies covering the periods of child- 
hood, adolescence, early and middle adulthood 
(1, 2, 5), but a paucity of longitudinal data is 
as yet available on older age groups. The prin- 
cipal difficulties involved in such studies are 
the long period of time required for completion 
and the unavoidable attrition as the original 
subjects advance in age, due to such factors as 
mobility, illness, and death. 

Cross-sectional studies, on the other hand, 
in which population samples in different age 
groups are compared, are open to several 
criticisms (3, 10). One criticism is concerned 
with the differences in educational background 
that are likely to characterize different age 
groups, usually to the advantageof the younger 
group. A second criticism is directed toward 
differences between age groups with respect to 
range of talent. For various reasons, it is diffi- 
cult to obtain comparably broad samples of 
older and younger populations. Older subjects 
are more frequently drawn from institutions 
for the aged or from small groups of volunteers, 
while the younger subjects represent broader 
segments of the population. There are also 
differences in motivation and in attitudes 
toward, and familiarity with, psychological 
tests, which may tend to favor the younger 
groups. Finally, while the disabilities of older 
people with respect to speeded tests have 
been well established, few attempts have been 
made to apply appropriate corrections or to 
employ tests where speed is not a factor. 

While these difficulties are serious, they may 
be minimized by restricting the groups com- 
pared to a fairly narrow age range. In the 
present study, the age range has been limited 
to the period 70-88 years. This restriction 
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made possible the selection of subjects who 
were relatively homogeneous with respect to 
educational background, socioeconomic status, 
motivation, and familiarity with psychological 
tests. In this restricted population, differences 
in “primary mental abilities,” “memory,” 
“Psycho-motor speed,” and “motor rigidity” 
were determined by appropriate tests. Cor- 
rection factors were then applied to the mental 
abilities scores to control the effects of loss of 
memory, reduced speed, and increased rigidity. 


METHOD 
Subjects 


All subjects (Ss) were volunteers obtained from 
the membership of a retired university faculty group 
or from among retired academic or professional workers 
who had responded to appeals in newspapers published 
in the city in which the university was located. The 
Ss were interviewed before being accepted for the re- 
search project, at which time it was explained that 
about eight hours, spread over several days, would be 
required for the physical examinations, psychological 
tests, and interviews involved in the larger project of 
which the present study is a part. At this point, ae 
was given an opportunity to withdraw from the ay 
if he wished. It seems reasonable to conclude that ted 
Ss who participated were at least as highly motiva! 
as the average volunteer group. 

The nic consisted of 25 male and 25 female col- 
lege graduates, ranging in age from 70 to 88 years, 
with a mean age of 76.5 years. Clinical and laboratory 
examinations indicated that all members of the group 
were in fair to superior physical condition for their age- 
No psychiatric complaints or symptoms were noted by 
the examining physicians. 


Tests 


In order to obtain estimates on more specific amit 
of mental ability than would be yielded by a ee: 
score, Thurstone’s Primary Mental Abilities Tes 
(Intermediate Form) (12), which provides separate 
scores for Verbal-meaning, Space, Reasoning, Number 
and Word-fluency factors, was selected. Previous noe 
(6) had shown that the reliability and the range ° 
difficulty for older subjects were adequate. am 

Since age changes in speed, memory, and rigidity 
are known to occur and might be expected to have some 
relationship to performance on the Primary Mental 
Abilities Test, measures of these variables were ob- 
tained. The Wechsler Memory Test (13) was selected 
as a measure of memory loss and Schaie’s Test of Be- 
havioral Rigidity (9) was used to obtain measures of 
“psycho-motor speed” and “motor rigidity.” 

All tests were administered as individual tests and 
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the acd r re n 
m mE of administration was randomized in order 
avoid systematic effects of fatigue. 


RESULTS 


T he subjects were rank ordered by chrono- 
logical age and divided into five groups of five 
males and five females each. A comparison by 
-= showed no significant differences in years of 
eion among the five groups. Standard 
5 nary Mental Ability (PMA) scores with 
eas of 50 and sigmas of 10 were then 
computed, by the use of Thurstone’s conver- 
sion chart, from the norms for his 17-year-old 
group. Comparable standard scores for the 
Ha and Tigidity measures were computed, 
5 ing Schaie’s tables for his younger groups, 
a 18-26. Standard scores on the Memory 
f e were computed from Wechsler’s norms 
or his 20-30-year-old group (13). 
wate 1 gives the mean standard scores for 
= of the five groups on all test variables. 
“xamination of this table indicates that, for the 
11-year-old group, the Reasoning and Spatial 
Scores are below the mean of the 17-year-old 
reference population. If it can be assumed, on 
the basis of the amount of education and of 
Previous occupational level, that the present 
ae at an earlier age would have been above 
: e mean of the general population on these 
ests, then an unspecified amount of loss in 
Teasoning and spatial abilities has occurred 
Prior to age 71. The Verbal-meaning, Number, 
and IQ scores are somewhat above the means 
for the younger reference population. Whether 
any loss has occurred on the Verbal and Num- 
ber factors cannot be determined from these 
data. The relatively low scores of the 71-year- 
olds on the Word-fluency factor is most 


TABLE 1 


MEAN T Scores FoR SUBGROUPS 


o = 10) 


=~ (Young Adult Norms, M = 50, 
S sg | | E A 
gleivej)s | Re] N w = Sej 5 |S 
1 | 71l58.5l41.9146.4157.9 “yr ols4.3/44.2)39.4146-4 
¢ 73/57 .1/37.6/44.1/55.6 REENE EE 
3 | 75/59438. 2140.6/58.1 co-ed Set Sf 
79/52.8/35.9|39.8'53.4 48.3]45.4)41 00 ola 9 
È | 8445.8133.5/32.0/48.0 48.340.533. oa 8.3 
Dis [12.9] 8-4/14.4| 9-9/0 7)13.8111.0110-4) 8- 


Key To Symbols: 


V = Verbal Meaning JQ = Intelligence Quotient 


S = Space Sp = Speed 
R = Reasoning Mem = Memory 
N = Number M-Rig = Motor Rigidity 


W = Word Fluency 
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Fic. 1. Mean T SCORES ON PRIMARY MENTAL 
ABILITIES FOR SUCCESSIVE AGE GROUPS 


probably an artifact of sampling. The memory, 
speed, and motor-rigidity scores fall below the 
means for their respective reference groups. 
Figure 1 presents graphically the subgroup 
means for the PMA scores. For the Spatial and 
Reasoning factors, the scores show a steady 
decline after age 71. The Verbal-meaning, 
Number, and Word-fluency scores remain 
fairly high in the 71-year-old group, and then 
decline. 


Figure 2 shows the curves of decline for the 


speed, memory, and motor-rigidity factors. 
There is little difference between the scores for 
the 71- and 73-year-old groups, but beyond age 
73 the curves drop steadily. The most rapid 
loss occurs in memory. 

Statistical analysis, using the / test to 
evaluate the significance of the difference 
between the youngest and the oldest groups, 
shows a decline significant at the .05 level on 
all factors except for Word-fluency. 

The next step in the analysis was to deter- 
mine whether the decline in PMA scores would 
be affected if corrections were made for the 
effects of lowered speed, increased rigidity, and 
memory loss. The previously described in- 
dependent measures of these abilities were 
therefore correlated with the PMA scores. 
Intercorrelations among all these measures 
were computed on the IBM Type 604 elec- 
tronic calculating punch, using the standard 
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TABLE 2 
Test INTERCORRELATIONS 


70 


2 
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a 
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+ a at 
AGE 


Fic. 2. Mean T Scores FOR Memory, SPEED 
AND Motor Ricwwity 


scores method (7). These intercorrelations, 
most of which are significant, are given in 
Table 2. Formulas were derived to correct for 
the effects of these factors on the PMA scores.” 
The correction formulas provide estimates of 
those parts of the PMA scores which are 
independent of the combined influence of the 
speed, rigidity, and memory Scores, and yield 
corrected PMA measures in terms of T scores 
for the norm group. 

It was not necessary to compute individual 
corrected PMA scores for the members of the 
experimental group in order to get estimates of 
the corrected subgroup means. These could be 
computed directly from the subgroup means 
given in Table 1 and the intercorrelations in 
Table 2, but the computations required the 
calculation of the beta regression vector for 
each PMA variable on the correction variables 
and also its multiple correlation with them, 


2 Materials presenting the mathematical derivation 
of the correction formulas, computational instructions, 
and tables of intermediate computations have been 
deposited with the American Documentation Institute, 
Order Document No. 5299 from ADI Auxiliary Publi- 
cations Project, Photoduplication Service, Library of 
Congress, Washington 25, D. C., remitting in advance 
$1.75 for microfilm or $2.50 for photocopies. Make 
checks payable to Chief, Photoduplication Service, 


Library of Congress. 


s | R|N|w | Q | 8p. | Mem| M-Ris. 
Verbal Meaning .34a|.612| 3001 1721 630! 676 -375 | «501 
Space 361) .193 -093|.470 .145) .266 | .263 
Reasoning +395) .354) .757/.674] .571 | .462 
Number -287) .784].411] .318 | .325 
Word-fluency 570) .396) .452 | .025 
Composite-IQ -667| .554 | 445 
Speed 410 | .318 
Memory 406 
Motor-Rigidity | 
TABLE 3 
Beta REGRESSION WEIGHTS AND MULTIPLE 
CORRELATIONS 
v] s | R | N w |o 
Speed „571 |010 f.495 |.305 -298  |.497 
Memory «013 |.187 |.292 | .120 .429 |.280 
Motor Rigidity |.315 |184 |186 «180 |—.244 |.174 
z «899 |.381 |.973 |.605 483 |951 
R? -548686/.099584 | .586294/.222015| .305816| .564049 
R TAL |.316 |.766 [an «553 |.751 
TABLE 4 


MEAN T SCORES AFTER CORRECTIONS FOR 
THE EFFECT OF SPEED, MEMORY 
Loss AND INCREASED MOTOR 


= Riciwity — 

Mean Age V S R N w IQ 
71 69.5 44.4 54.7 63.1 53.6 66.3 
73 67.2 38.9 49.1 50.9 59.2 60.4 
75 71.5 40.1 45.1 63.3 67.4 65.5 
79 65.9 40.2 50.4 60.7 56.6 58-4 
84 64.1 39.3 47.9 58.9 61.2 60-4 
Dis 5.4 5.1 6.8 `42 —7.6 5.9 


The regression weights and multiple correla- 
Hons are given in Table 3, and the correcte 
subgroup means in Table 4. Strictly speaking, 
the correlations from which the regression 
weights and multiple correlations are derived 
should be based on a single norm group. Since 
this was not possible, it was necessary to use 
the correlations in Table 2, based on the 
experimental group. 

Even though it was not necessary to use the 
individual scores to calculate the corrected 
subgroup means for the experimental group, 
corrected PMA scores may be computed for 
any given individual by means of Table 58. 
Assume that a person’s corrected (P.) score 

° The derivation of this table is indicated in Section 


C of the material deposited with the American Docu- 
mentation Institute. See Footnote 2. 
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TABLE 5 


Weicuts USED IN CALCULATING INDIVIDUAL 
CORRECTED PMA Scores 


r|nliw |12 


| v | s | 

a) 
We | 1.480) 1.054 1.585) 1.134] 1.200) 1.514 
Ws "850, O11] -770, 1346] .358 752 
Wa | 019) iig casal 136) Sis) -424 
Wie | p.409] 194) -289| 2081-295) 263 

42.5 [1714 47.9. (27.6 [19.0 [46.3 


A desired for any given PMA score. Let his 
i scores on speed, memory, and motor rigidity 
be respectively Cs, Cm, and Cr. His Pe score 
Is given by 


P, = PWp — CsWs — CaWar — CrWr + F. 


The values for the W’s and for F are taken 
from the appropriate column of Table 5. For 
example, suppose a person’s Pe score for the 
verbal factor is desired and that his verbal 
score is 47, and his speed, motor, and rigidity 
scores are, respectively, 53, 49, and 45. Using 
the values in the first column of Table 5 we 
have 


Pe = 47(1.489) — 53(.850) 
— 49(,019) — 45(.469) + 42.5 


Curves for the corrected subgroup means are 
shown in Figure 3. When corrections are made 


= 45.4. 
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TABLE 6 


STANDARD SCORES ON THE PRIMARY MENTAL 
ABILITIES TEST FOR SUBJECTS OVER 
EIGHTY YEARS OF AGE 


A Verbal- -., Reason- Jord- 
Age meaning Spatial ing Number ie 
Females: 
80 58 33 39 39 40 
80 56 32 48 56 49 
80 56 39 42 49 38 
81 42 36 30 50 58 
82 54 35 33 58 39 
82 34 27 27 27 40 
83 50 27 27 52 41 
we 35 37 34 35 67 
Males: 
80 46 33 32 49 52 
81 49 37 36 56 47 
82 51 40 33 41 52 
82 51 38 33 60 37 
83 50 41 35 52 65 
s4 50 34 30 50 42 
85 50 27 44 61 40 
88 42 33 27 35 54 


for losses in speed and memory and for an 
increase in motor-rigidity, the age curves for 
the Primary Mental Abilities flatten out 
appreciably. The Verbal, Number, and Word- 
fluency scores remain approximately 1 SD 
above the mean of the younger age groups at 
age 84. Scores on the Spatial factor, while 
significantly below the mean for the younger 
ages, show no appreciable change over the 
age range, 73-84. 

The range of differences in Primary Mental 
Abilities scores for individuals of advanced 
age is of interest. Table 6 shows the uncor- 
rected standard scores (with a mean of 50 
and an SD of 10) on this test for the oldest 
subjects in the present group. Of the total of 
16 in this age range, the verbal meaning 
scores of ten Ss and the numerical ability 
scores of nine remain at or above the mean for 
an unselected population of 17-year-olds. 


DISCUSSION 


The retention of verbal ability shown here 
is consistent with the results of other studies. 
Owen’s data (3) and those of Bailey and Oden, 
(2), both obtained by longitudinal methods, 
indicate that some verbal abilities continue to 
increase up to age 50. Doppelt and Wallace 
(4) have recently reported normative data for 
the Wechsler Adult Intelligence Scale on a 
fairly large and representative sample of older 
persons. The Vocabulary, Arithmetic, and 
Information tests show the smallest decline up 
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to age 70. For their oldest age group, with a 
mid-point age of 79.5, the Vocabulary score is 
only approximately .5 SDs below the mean 
of the reference population. š 
The studies most directly comparable with 
the present one are those of Schaie, Rosenthal, 
and Perlman (6), and of Sward (11). In the 
Schaie study, the oldest age group ranged from 
71-78 years and the general educational level 
was considerably below that of the present 
group. On the PMA test, Schaie’s older Ss fell 
below the mean of the 17-year-old reference 
population on all factors. The lowest scores 
were obtained on the Space and Reasoning 
factors, as in the present study, and the highest 
scores were on Number and Verbal-meaning, 
as in the present case. However, the rank 
order of magnitude of these Scores was dif- 
ferent in the two studies. The earlier study 
showed the greatest loss on the Reasoning 
factor and the least on Number, while the 
present study showed the greatest loss on the 
Spatial factor and the least on Verbal-meaning. 
Sward’s study is of particular interest 
because his Ss were drawn from university 
faculties and thus were fairly closely com- 
parable to those used in the present study. His 
older group ranged in age from 60-79, with a 
mean CA of 66.2; his younger group ranged 
from 21-42, with a mean CA of 31.4. Unfor- 
tunately, from the standpoint of any direct 
comparison of his results with ours, Sward 
used a special test battery, the correlations of 
which with PMA scores are unknown. How- 
ever, on all except two of his tests the older 
group did significantly less well than the 
younger. On his “word-meaning” test, the 
older group did as well as the younger. On a 
“synonym-antonym” test, the older Ss were 
significantly better and on an “index of 
verbalization” which Sward constructed, the 
older Ss also scored higher. These results are 
consistent with those obtained in the present 
study and lend support to the conclusion that, 
at least in a population of superior intelligence, 
verbal abilities remain at a high level into the 
middle seventies. 


SUMMARY 


Scores on primary mental abilities, speed, 
memory, and motor-rigidity were obtained for 
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a group of fifty college graduates, ranging in 
age from 70-84 years. Differential decline in 
these abilities with advancing age is shown, 
with the earliest and largest losses occurring 
in memory, in speed, and in reasoning and 
spatial abilities. Word-fluency, verbal-mean- 
ing, and numerical abilities in this group of 
superior individuals remain well above the 


mean for young adults until the middle 
seventies. Verbal-meaning and numerical 
ability scores remain at or above the mean for 
the 17-year-old reference population in a 
majority of the Ss over eighty years of age. 
Since this study is cross-sectional, rather than 
longitudinal, the possibility that selective 
factors may account for the high scores of the 
older Ss cannot be ruled out, 
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JUDGMENTS OF CONSENSUS DURING GROUP DISCUSSION! 


PHIL STONE anp JOE KAMIYA 
University of Chicago 


r THE members of a group are asked to re- 
cord privately their agreement or dis- 
agreement with a given proposition and 

to estimate the proportion of the group that 
shares the same opinion, the resulting data 
can be analyzed to reveal the degree of ac- 
curacy in the members’ appraisal of their 
group’s consensus. In several investigations 
(1, 2, 3, 4, 5) this technique, or some variant 
of it, has been used on a wide variety of 
opinion and attitude areas. Such studies have 
generally shown that individuals can estimate 
the consensus of their group with some 
degree of accuracy. Presumably, the estimates 
are based at least partly on knowledge acquired 
from past contact with whatever cues to the 
group’s opinions that were available. The 
present study, with the general aim of shed- 
ding light on the manner in which the cues 
influence the estimates, undertakes a descrip- 
tive preliminary survey of judgments of group 
opinion in small discussion groups, where cues 
for the estimation of group opinion are pro- 
vided through a discussion of the issues by 


the members themselves. 
The questions explored are: Is the degree 


of estimation accuracy achieved through 
group discussion markedly different from 
that found in previous investigations where 
conditions of group discussion were not used? 
What is the nature of the errors? How are 
changes in estimates and in opinions related 


to the discussion content? Is accuracy in- 


creased by discussion? 


METHOD 


Subjects 

The subjects (Ss) were six groups of high school 
students in Evanston, Illinois. The W of each group 
varied from 14 to 16. Both sexes were approximately 
equally represented in all classes. Ninety-two per cent 
of the students were either 15 or 16 years old, the re- 


mainder being 17. 


Topics of Discussion 


In order to establish the discussion itself as the pri- 
mary source of cues for the estimates of group consen- 


1 This study is part of a project supported by a grant 
from the Social Science Research Council. 


sus, the topics presented were novel to the group. Pre- 
tests of comparable groups showed that the topics had 
probably not been discussed by the Ss before the study, 
but were of sufficient interest to maintain active dis- 
cussion. In their order of presentation, the topics were: 
(a) whether the state law should be changed so that 
some high school juniors and seniors could drive the 
school buses at their school, (b) whether bicycle riders 
should be required by law to walk their bicycles across 
all intersections, and (c) whether women (adults) 
would be suitable as school bus drivers. 


Procedure 


The Ss were seated facing each other around a circle. 

Response forms were provided each S. The experimenter 
(E) began the session for each group by saying that the 
purpose of the study was to survey their opinions on 
some topics through group discussion. The Ss were told 
to try to reach a group decision on the issues presented 
them. 
The E explained that during the discussion he would 
occasionally interrupt the discussion, and request each 
person to record on the form his opinion on the specific 
proposition then being discussed. Only “yes” or “no” 
opinions were permitted, corresponding to agreement 
or disagreement with the proposition. Immediately 
after indicating his own opinion, S was required to 
record his estimate of the number of persons in the 
group who held the same opinion as he did. Each re- 
cording of an opinion and corresponding estimate is 
hereafter referred to as an “item.” 

The first item on each main topic was administered 
before any discussion occurred. The S was asked to 
give a favorable or unfavorable (yes or no) opinion and 

the topic as worded above. 


a consensus estimate on 
Subsequent items dealt with various facets of the main 
topic, as brought out by the discussants themselves. 


These items were careful formulations by Æ of the S’s 
contributions into concrete propositions. A sample item 
from the first topic: “High school junior and senior bus 
drivers wouldn’t be able to handle the rowdies on the 
bus.” “Answer ‘yes’ if you agree; ‘no’, if you disagree, 
and estimate the number of people who marked the 
form the way you did.” 

The extent of discussion allowed between items was 
varied. At times only one S’s contribution intervened; 
at the other extreme, as many as six different Ss were 
allowed to contribute before the next item was taken. 
Among all the items taken during the discussion of a 
main topic, some pairs of items were directly compara- 
ble, as a result of an S repeating a proposition which had 
been used as a previous item. These pairs were used to 
assess changes in the opinions and the estimates. 

The Ss were informed in advance how long the dis- 


cussion on each topic would last. At the end of the 
allotted period, E gave a final summary item and di- 
opic. The total 


rected the discussion to the next main t 
time of discussion was about 45 minutes. The total 
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number of items taken in each group over all three 
major topics ranged from 18 to 26. 


RESULTS AND DISCUSSION 
Accuracy of Estimation 


Presented first are measures of estimation 
accuracy over all items without regard to the 
nature of the discussion preceding the items. 
Two different measures were used. One 
measure was the difference between the 
estimated group opinion and the actual 
group opinion. The other measure was the 
Pearson product-moment correlation between 
estimated and actual opinion. These two 
measures were used in the analysis of the 
estimates of individuals as well as the pooled 
median estimates of the group. 

Accuracy of individuals. For each S, the 
absolute difference between the estimated 
and actual group opinion was obtained on 
every item taken during the discussion period. 
The average of these differences over all 
items for a single S represented his average 
error score, in number of persons. The lowest 
average error obtained was 1.42 persons; 
the highest was 4.72 persons. The mean and 
SD of these average individual error scores 
are presented for each group in Table 1, 
Items 1 and 2. 

For the correlational accuracy score of 
individuals over all items, 7’s were computed 
between estimated and actual group opinion 
for each of 18 Ss randomly selected from the 
six groups. The 7’s ranged from .11 to .75 
with a mean of about .40, values roughly 
comparable to the mean rs of .69 and .44 
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reported by Travers (4) and Crow (2), respec- 
tively. 

Accuracy of median eslimales. For analysis 
of the estimates of the group as a whole, the 
median estimate was determined for each 
item. Since the task for each S was to estimate 
the number of Ss who shared kwis opinion, 
a transformation was made before the median 
was computed to render the estimates of 
all Ss into comparable estimates of the number 
of “‘yeses.” 

To measure group accuracy in terms of 
difference scores the absolute difference be- 
tween the median estimate and the actual 
group opinion was determined for each item. 
The mean and SD of these error scores for 
each group are shown in Items 3 and 4 of 
Table 1. The mean error, expressed as a 
percentage of group size, is shown in Item 5. 
These values are also comparable to results 
found in other studies (3, 4). 

As would be expected from the effects of 
pooling, the mean error of the estimates of 
the entire group is considerably smaller than 
the mean of the average error scores of 
single individuals shown in Item 1, However, 
further analysis indicates that the relative 
superiority of the group over the individual 
estimate is even greater than might be ex- 
pected on these grounds: of the 88 Ss tested, 
there are none with average error scores that 
were smaller than the mean error of the poole 
estimates of their group. 

In every group the correlation across all 
items between the median estimates of the 
entire group and the actual consensus was 


TABLE 1 
SUMMARY OF RESULTS 


Item O 
I ii II IV v vi 
Number of Subjects 14 15 14 14 16 15 
Number of Items 19 18 23 26 19 20i 
1. Mean of individual average error scores 2.43 2.65 2.57 2.44 2.57 E 9 
2. SD of individual average error scores 41 -68 „56 53 -80 an 
3. Mean error of group median 1.26 1.33 1.61 96 1.42 2.16 
4, SD of mean error a .84 1.00 1.00 65 90 1% 
5. Item 3 as percentage of group size 9% 9% 11% 1% 9% j 14 bo 
6. Correlation between group estimate and actual con- 95 -96 -90 95 -97 
us > 
7 Mean algebraic error of factions 42.12 +2.47 +2.21 42.52 +2.89 et 
8. SD of algebraic error f 2.66 3.43 3.13 2.9% 4.27 E 
9. Mean absolute error of factions 20 4 3.03 21 aie 2.42 
e a error 5 : y t -56 x 
10. SD ol a emn group estimate and actual con- -67 -56 .64 72 57 -63 
GONE n opinion held constant . ee 
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20 or higher (see Item 6, Table 1). These 
r’s are similar to the 7’s of .82 and .93 found 
by Campbell (1) and Travers (4). 

A ccuracy and assumed similarity. Previous 
studies (1, 4, 5) have shown that Ss’ estimates 
of consensus are correlated with their own 
opinions. That is, those Ss who share a given 
opinion on an issue give a larger estimate of 
the numbers holding their opinion than do 
those who hold the opposite opinion. To 
determine the extent to which this tendency 
for Ss to assume similarity between themselves 
and others occurs under conditions of open 
discussion as created in this study, the median 
estimates of those whose own opinions were 
‘yes” were computed separately from the 
median estimates of those whose opinions 
were “no.” It was found that on every one 
of the 125 items in the entire study, the median 
estimate of one side was larger than the comple- 
ment of the median estimate of the other 
side. The results of previous studies are thus 
clearly supported here. 

In terms of accuracy, the foregoing finding 
means there is a general tendency for Ss on 
each side of an issue to overestimate the 
numbers who share their opinion. The average 
net tendency toward overestimation per 
item is shown in Item 7 of Table 1. These 
figures are averages of the algebraic differences 
between the median estimate of each side and 
the actual number of Ss on that side. A 
comparison of these differences with the 
absolute differences between the same meas- 
ures (Item 9) clearly shows that the predomi- 
nant error in each group is that of overestima- 
tion. 

Consideration of the task confronting the 
Ss makes it clear that the direction and extent 
of error in estimation can not be uniform 
for all sizes of “factions” (those who share 
one opinion). If all group members hold 
the same opinion, the only error of estima- 
tion that can possibly occur is underestimation, 
since the highest estimate that can be given 
is limited to the total number of persons in the 
group. At the other extreme, if only one 
member holds a given opinion, he can err 
only in the direction of overestimation. 

If such constraining effects of group size 
were the only factors influencing the errors 
in the estimates, equal but opposite errors in 
the median estimates of the two extremes 
of faction size would result. However, as 


could be predicted from the observed wide- 
spread tendency to overestimate, such sym- 
metry in the errors did not occur in the 
data. Analysis showed that factions comprised 
of about 10% of the group gave median esti- 
mates that averaged about 55-60% of the 
group size, while factions comprised of 90% 
of the group gave median estimates that 
were about 80% of the group size. The esti- 
mates of the intermediate sized factions fell 
approximately on a straight line between 
these two extremes, with greatest accuracy of 
estimation being found in factions comprised 
of approximately 75% of the group. Hence, 
it is clear that all but the largest factions 
tended to overestimate their size. 

The fact that all factions, regardless of 
size, assumed on the average that more 
than half the group shared their own opinion 
(or that less than half of the group held 
the opposite opinion) accounts in part for 
the high degree of accuracy of the median 
estimate of the entire group, since a majority 
of the estimates on which the group median 
is based comes from the majority side, and 
the pooled estimate of the entire group is 
therefore partly determined by the actual 
consensus. The accuracy of the pooled median 
estimate is thus spuriously high when taken 
as an index of the judgmental achievement 
of the group. As an empirical fact, however, 
the high accuracy of median estimates should 
lend itself to the use of estimates for the 
indirect measurement of attitudes in small 
groups. 

An attempt was made to determine whether 
any correlational accuracy would remain if 
the factor of assumed similarity were excluded. 
The S’s own opinion was held constant by 
computing the accuracy correlations of the 
“yes” factions and the “no” factions separately 
across all items, and then averaging the two 
correlations. The results, shown in Item 11 of 
the table, indicate appreciable judgmental 
accuracy apart from the spurious effects of 
assumed similarity, with 7 ranging from .56 
to .67 among the six groups. 


The Influence of Group Discussion on Estimates 
and Opinions 
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estimates and opinions. All six groups were 
combined in this analysis. The changes were 
assessed by comparing the estimates and 
opinions taken on an item before discussion 
with those taken on a comparable item after 
discussion. From the six groups, a total of 
49 comparable before-and-after item pairs 
was found. 

Overall favorableness or unfavorableness 
toward the propositions presented of the 
views expressed in discussion was rated on a 
seven-point scale by two judges using verbatim 
transcripts. All relevant expressions of opinion 
intervening between the comparable items 
were rated as a single unit. The judges based 
their ratings on the number of different 
Ss speaking as well as the content of each 
S’s speech. The inter-judge correlation over 
the 49 discussion units was .82. The average 
rating of the two judges on each item was 
transformed into a percentage according to 
its relative location within the range of all 
the ratings, in order to provide an index 
of the favorability of discussion that could 
be compared directly with the degree of 
favorableness of the group opinion measures. 

Changes in estimates. Since the views 
expressed in support of a proposition (or in 
opposition to it) during discussion provide 
cues for the estimation of consensus, it is 
reasonable to expect the median estimate of 
the group to change in the direction of the 
expressed support, unless the expressed support 
is representative of the initial estimate. The 
expressed support was considered representa- 
tive if it did not differ from the initial estimate 
by more than 10% of group size. Eliminating 
these representative cases, the median estimate 
shifted toward the level of expressed support in 
35 out of 41 cases (p < 001.)° . . 

Changes in actual opinion. Since the views 


2 For assessing the significance of this proportion, 
the value for the chance proportion under the null 
hypothesis was determined empirically, following the 
method used by Crow (2) for a somewhat different 
problem. The relative frequency with which the ob- 
tained changes occur in the expected direction was de- 
termined, using level of expressed support in every 
other discussion unit in the study asa base of prediction. 
This gave a total of 49 X 48, or 2352, comparisons. The 
roportion of cases in which the estimates shifted in 
p! Pi ction of these other ratings of discussion units 
cg This value was taken as b in the binomial 
distribution, against which the obtained proportion of 


35/41 was evaluated. 
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expressed during discussion may affect opinions 
as well as estimates, a parallel analysis was 
undertaken of the changes in the opinions. 
The results were again clear cut. In 32 out of 
41 cases, the actual group opinion shifted in 
the direction of the expressed support. This 
obtained proportion, evaluated against its 
chance level of .53 (obtained by the same 
procedure as for the estimates) is significant 
at the .001 level. 

Other findings. The correlation between 
changes in the median estimates of the group 
and the changes in actual group consensus 
was found to be .89, significant at the .001 level. 

It might be expected that the estimates 
would shift more in the direction of the 
expressed support than would the opinions, 
on the ground that the Ss are likely to think 
others are more affected by what is said 
than they are themselves. However, the 
results from several methods of analysis 
point to no significant difference between 
the change scores for the two measures. 

No significant tendency of error reduction 
was found. In the 49 units of discussion, 
error (expressed as the sum of the members’ 
absolute errors on an item) was reduced only 
22 times. Moreover, no tendency was found 
for error reduction to be related to the number 
of people who spoke during the discussion unit. 

Although the error is not reduced, the 
discussion might nevertheless tend to produce 
a convergence of estimates toward a common 
level. A tendency was in fact found for the 
variance among the individual’s estimates to be 
reduced by discussion (p = .08). 

Interpretation, That accuracy was not 
increased by the discussion is understandable 
in light of the high degree of similarity observe 
in the changes of the estimated and actua 
consensus. The similarity of the changes, 
in turn, is in part accountable for by the 
previously observed relationship between t s 
Ss’ estimates and their own opinions. Given 
this relationship, changes in the Ss’ opinions 
should be accompanied by changes in their 
estimates. a 

Whatever the reasons may be, it is appare”? i 
that not only do the expressions of opinio? 
influence the estimates; they influence the 
group consensus itself, and in substantially 
the same manner. The types of conditions 
upon which this dual effect depends needs t° 
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be explored. The findings of the present study 
suggest that rather complex interactions may 
be involved among expressions of opinion, 
group consensus, and judgments of consensus, 
and that the common conception of group 
discussion as a means of increasing the under- 
standing of the group members’ views needs 
qualification. 


SUMMARY 


Six groups of high school students held 
discussions during which the individual’s 
opinions and estimates of the group consensus 
were privately taken on the issues considered. 

The degree of estimation accuracy achieved 
was not noticeably different from the accuracy 
found by other investigators under conditions 
of no discussion. Tendencies among Ss to 
overestimate the number of group members 
who share their opinion similar to ones 
previously reported, were also found. Even 
minorities comprised of only about 10% of 
the group tended to estimate that more than 
half the group shared their opinion. 

The views expressed by the Ss during dis- 
cussion influenced both the median estimate 
of consensus and the consensus itself. The 
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influence on both was highly similar in that 
(a) the amount of change in the actual con- 
sensus was correlated with the amount of 
change in the estimated consensus, (6) both 
the estimated and actual consensus tended to 
change in the direction of the views expressed, 
and (c) the amount of change in the estimated 
consensus was not significantly greater than 
the amount of change in the actual consensus. 

Over the course of discussion, the estimates 
did not tend to become more accurate. There 
was a slight tendency toward reduction in 
the variance of estimates among the Ss. 
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University of 


HE predictions of an observer about the 
behavior of another person are a product 
| of the observer’s hypotheses or concep- 
tualization regarding the particular individual. 
The accuracy of his predictions depends upon 
the validity of the conceptualization. His hy- 
potheses are, in turn, a function of the infor- 
mation available about the person to be judged, 
and of the judge’s typical way of categorizing 
information or forming concepts about others. 
These assumptions were developed into three 
hypotheses which the present study was 
designed to test. 

A more accurate impression or conceptualiza- 
tion of another person should be possible when 
more information is given the observer about 
the individual, and a more accurate prediction 
of his behavior should therefore result. 
Hypothesis 1. The more information provided 
judges aboul a person, the more accurate are 
their predictions of his behavior. 

The judge’s typical concepts or ways of 
grouping others determine the kinds of 
discriminations and consequent predictions he 
makes about them. Judges with a larger 
number of concepts for grouping others 
(hereafter called Complex Judges) should 
develop more complete and more accurate 
impressions of an individual than those who 
use few concepts (hereafter called Simple 
Judges). Hypothesis 2. Complex Judges more 
accurately predict behavior than Simple Judges. 

Since one function of information is to 
provide the setting for fine or multifactor 
discriminations, one would predict that the 
impressions developed by Complex Judges 

would be more complete and representative ofa 
particular person when they are given ad- 
ditional information. Simple Judges, however, 
should continue to classify data about the 
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person observed into the few categories avail- 
able to them and their impressions should 
not increase in accuracy with amount of 
information provided. Hypothesis 3. With in- 
creasing information, Complex: Judges show a 
relatively greater increase in predictive accuracy 
than Simple Judges. 


METHOD 


The design required a pool of subjects to 
serve as judges and interviewees, who varied 
in respect to simplicity or complexity of 
interpersonal concepts, and in the amount of 
information received. Judges were given infor- 
mation about another person by means of 
tape-recorded interviews, and then they pre- 
dicted the other person’s responses to a 
questionnaire. 


Selection of Judges and Interviewees 


Bieri’s modification of the Kelly Role Repertory 
Test (1, 3) was used to rate subjects for conceptua 
simplicity or complexity. Subjects (Ss) were given a 
list of 12 different categories of people (e.g., a person 
you’d like to know better; the most ethical person you 
know, etc.). The S named an acquaintance or relative 
to whom each description could be applied. He com- 
pared selected triads of these 12 individuals, and fa 
each triad he was instructed to provide a personal Gam 
acteristic or trait in regard to which two of the person 
were similar to one another but different from the m f 
Any trait or psychological concept could be used ik 
the differentiation. The S then checked the two like 
individuals. His next task was to see if the concept n 
plied to any of the other nine individuals not in fa 
original comparison, and to check any to whom We 
concept applied. The Ss made twelve comparisons, 
using a different specified triad for each. i from 

Scorable protocols on this test were obtained i 
253 male students in elementary psychology, and E 
for overlap or simplicity in use of concepts. Ke ais 
extent that the subject checked the same individ ae 
as examples of the constructs he had formed in the en 
cessive comparisons of triads, he obtained a high sc 2, 

“on simplicity. Scores obtained ranged from 0 to è 
with a distribution skewed toward high, or Lee 
scores. Subjects scoring 20 or above were classified ten 
simple, and those scoring between two and six as Pia 
plex. From these extreme groups, 14 Ss were se p 
as interviewees (7 simple and 7 complex), and 5 
judges (28 simple and 28 complex). , a 

To determine whether simplicity-complexity a 
related to intelligence, the A.C.E. scores for a randon 
selected sample of these Ss, 95 in all, were correla 
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with their simplicity-complexity scores. The obtained 
correlation (r = —.04) was not significantly different 
from zero. 


Information Provided 


The information from which judges were to make 
their predictions was obtained by tape recording in- 
terviews with seven simple and seven complex indi- 
viduals, Four content areas were covered in the inter- 
view: (a) family background—description of parents 
and siblings; (b) school and job experiences—descrip- 
tions of teachers and bosses as well as relevant activities; 
(c) values—description of girl friend, and ideal mate, 
future plans; (d) self-description—how others sce self, 
desires for personal change, and attitudes toward others. 
The information from topics a and b above, about 15 
minutes of the tape, constituted partial information, 
while the entire interview of about 35 minutes consti- 
tuted the condition of complete information. 


The Prediction Questionnaire 


g Upon the close of the interview, the interviewee gave 
his answers to a 38-item questionnaire which then 
served as the criterion for the accuracy of the judges’ 
predictions. 

This questionnaire consisted of 38 four-choice items 
concerning behavior in social, work, and school situa- 
tions. The questionnaire was administered in the ele- 
mentary psychology classes to determine the distri- 
bution of responses to the items under conditions of 
self-description. Completed questionnaires were ob- 
tained from 236 males in addition to the 56 who later 
Served as judges. 

The distribution of responses for the group of 236 
showed definite stereotypy of self-description, indi- 
cating that the chance level of accuracy would not be 
the same for all item alternatives. In order to compen- 
sate for the advantage this stereotypy would otherwise 
have given a judge who relied on group norms rather 
than on the interview information for his predictions, 
a weighting formula was developed which gave greater 
weight to accurate predictions the more the inter- 
viewee’s response deviated from the group stereotype 
and the more the judge’s prediction deviated from the 
use of “normative concepts.” The proportion of times 
ed each alternative was tabulated for 
d weights to be applied to correct pre- 
ternative were derived according to 
the following formula: one minus the proportion of 
self choices divided by the proportion of times the 
choice is predicted by judges. As the proportion of 
times an alternative is used for self-description or pre- 
diction increases, the weight decreases. The use of the 
formula was intended to give greater weight to pre- 
dictions that reflected discrimination of qualities char- 


acteristic of the individual interviewee. 


judges predict 
every item, an 
dictions of an al 


Design and Procedure 
i imple 

i designed so that both Simp 

+O ee a simple and a complex 


and Complex Judges judged d p 
biecdtewte, with partial and complete eerie 
and with each of these combinations of varia a 2 
first and second position. The design (Table 1) ca 


TABLE 1 
EXPERIMENTAL DESIGN SHOWING TYPE OF 
INTERVIEWEE, AMOUNT OF INFORMATION, 
AND SEQUENCE OF JUDGING FOR SIMPLE 
AND COMPLEX JUDGE PARS 


Judge Pairs 
1 2 3 4 
+> 

Order Simple Simple Simple Simple 
Judged Complex Complex Complex Complex 
First 

Amt. of Inf. 15 min. 35 min. 15 min. 35 min. 

Interviewee simple simple complex complex 
Second 

Amt. of Inf. 35 min. 15 min. 35 min. 15 min. 

Interviewee complex complex simple simple 


for four pairs of judges, each pair composed of a Simple 
and Complex Judge, and one pair of interviewees, a 
Simple and a Complex. This design was replicated 7 
times with a total of 56 judges and 14 interviewees 


participating. 

Two judges, one Simple and one Complex, were 
scheduled for each of the judging sessions. Before listen- 
ing to the recorded interviews, judges studied the con- 
tent of the questionnaire items. They were then in- 
structed to listen to each of two interviews, and to 
formulate as accurate an impression of the interviewees 
as possible. After hearing each recording, the judges 
answered the questionnaire in the way they thought 
the interviewee had. An incentive award of two dollars 
was offered to the judge who was most accurate for one 
interviewee, and an eight dollar prize if he was most 
accurate for both. 


RESULTS AND Discussion 


Hypothesis 1, Predictions made with com- 
plete information were more accurate than 
those made with partial information (p < .05), 
when unweighted scores are considered. This 
effect was not significant for the weighted 
accuracy scores. Perhaps complete information 
provided the judges with a clearer idea of the 
interviewee’s subgroup membership, rather 
than of his idiosyncratic characteristics. 

Hypothesis 2. There was no difference be- 
tween Simple and Complex Judges in the 
total number of correct predictions. However, 
when compared on weighted accuracy, Com- 
plex Judges showed a tendency to be more 
accurate (p < .10). This trend is consistent 
with the interpretation that Complex Judges 
are more likely to base their predictions on 
the characteristics of a particular interviewee. 

Hypothesis 3. A comparison of the changes 
in unweighted accuracy scores revealed that 
Simple Judges showed relatively greater 
improvement than Complex Judges when 


TABLE 2 


ANALYSIS OF VARIANCE, UNWEIGHTED 
Accuracy Scores 


df Variance F 


Between Judge Source 


Between Subject Pairs 6 52.31 3.76*** 
Judge Type 1 -08 à 
Information X Order 1 2.01 
Information X Sub- 1 6.51 
ject Type 
Order X Subject Type 1 16.51 
J. Type X Subj. Pairs 6 13.00 
J. Type X Inf. X Order 1 8.58 
J. Type X Inf. X S. 1 13.58 
‘ype 
J. Type X Order X S. 1 1.08 
Type 
Residual Between 36 13.91 
Within Judge Source 
Subject Type 1 40.08 
Subj. Type X Subj. 6 31.87 4.60*** 
Pairs 
Information 1 33.22 4.80** 
Order 1 25.08 3.62* 
J. Type X Inf. 1 19.72 2.85* 
J. Type X Order 1 33.22 4.92** 
J. Type X S. Type 1 -12 
Inf. X Order X S. Type 1 8.58 
Residual Within 43 6.92 
Residual Bet./Residual 36/43 2.01** 
Within 


* 10 >P > 05. 
** 05> P > 01. 
ee 01> P > 001. 


judging with complete as compared with 
partial information (p < .10). The increase 
in unweighted accuracy with complete infor- 
mation was, therefore, largely due to increased 
accuracy by Simple Judges. 

These data raise two questions: (a) How 
did Simple and Complex Judges differ in the 
process by which they arrived at their pre- 
dictions, and (6) What role did perception of 
individual interviewee characteristics play in 
prediction? How can we understand the 
trends for Complex Judges to predict more 
atypical items and for Simple Judges to im- 
prove more with complete information? 
Could these differences in accuracy have been 
a function of actual similarity between the 
judges and the interviewees in their own 
responses to the questionaire? Analysis of 
similarity in self-response indicated no signifi- 
cant differences in judge-interviewee simi- 
larity for any experimental condition. ZUN 

Since Bieri (1) found greater ‘assimilative 
iection,” ie., prediction of similarity, in 
ra ; it is possible that prediction of 
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Simple Judges in the present experiment. An 
analysis showed significantly greater prediction 
of similarity between interviewee’s and judge’s 
responses by Simple Judges than by Complex 
Judges (p < .05). Therefore, the components 
of accuracy (the proportion of actual response 
similarities accurately predicted, and the pro- 
portion of actual response differences accurate- 
ly predicted) were analyzed. The trends showed 
that Simple Judges correctly predicted a great- 
er proportion of similarities and the Complex 
Judges correctly predicted a greater proportion 
of response differences. However, neither trend 
was significant. 

Though Simple Judges predicted signifi- 
cantly greater similarity than did Complex 
Judges, Complex Judges were more able to 
order the interviewees in terms of similarity 
to themselves. Correlations of actual similarity 
with amount of similarity predicted by the 
judge were computed separately for each 
combination of experimental conditions and £ 
scores averaged. The correlations were, r = .49 


TABLE 3 


ANALYSIS OF VARIANCE, WEIGHTED 
Accuracy SCORES 


df Variance F 


en 
Between Judge Source +* 
Between Subject Pairs 6 260.34 2.66, 
Judge Type 1 375.22 3.8 
Information X Order 1 82.05 
Inf. X Subj. Type il 12.04 
Order X Subj. Type 1 = 30.93 $ 
J. Type X Subj. Pairs 6 122.24 1.25 
J. Type X Inf. X 1 85.16 
Order 
J. Type X Inf. X S. 1 4.58 
‘ype 
J. Type X Order X S. 1 40.87 
ype 
Residual Between 36 97.77 
Within Judge Source 
Subject Type 1 175.20 oot 
Subj. Type X Subj. 6 406.87 4. 
Pairs 14 
Information 1 188.45 2. gt 
Order 1 385.32 4.3 
J. Type X Informa- 1 18.14 
tion 4.907" 
J. Type X Order 1 438.06 : 
J. Type X Subj. Type af 7.22 
Inf. X Order X Subj. 1 6.75 
Type 
Residual Within 43 88.06 11 
Residual Bet./Resid- 36/43 1. 
ual Within = 


*10>P> .05. 
** 05> P> .01. 
*** 01> P> .001. 
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(p < .01) for Complex Judges, and r = .24 
(not significantly different from zero) for 
Simple Judges. These results support the 
possibility that Complex Judges predicted on 
the basis of an impression of the individual 
interviewee that was generated from the in- 
formation provided him. Although this inter- 
pretation is in agreement with the finding that 
Complex Judges were more accurate on 
individually relevant items, the data are more 
Suggestive than conclusive. 

Observing judge performance as a function 
of the other independent variables may further 
clarify the nature of the prediction process as 
well as differences between the Judge types. 
While the increase in accuracy with full in- 
formation was greater for Simple Judges than 
for Complex, both types of judges showed 
greater accuracy under this condition. The 
increase in accuracy cannot be attributed to 
predictions of similarity since predictions of 
similarity decreased significantly (6 < .05) 
with complete information. Complete infor- 
mation also resulted in a decrease in the 
proportion of real differences incorrectly pre- 
dicted as similarities (p < .01), and an increase 
in the proportion of correctly predicted 
differences (p < .001). These results may bea 
function of the content of the information as 
well as of the amount of information. Complete 
information alone disclosed the interviewees’ 
self-perceptions to the judges. A possible 
reason for complete information being more 
helpful to Simple and Complex Judges is that 
the material on the interviewees’ self-percep- 
tions may have been very relevant for cor- 
recting the kind of errors the Simple Judges 
were making, i.e. not differentiating the 
interviewees from themselves. On the other 
hand, Complex Judges, when making pre- 
dictions with complete information, may have 
over-differentiated between themselves and 
others, Over-differentiating the interviewees 1s 
likely to result in error, since people tend to 
respond similarly to the questionnaire. Trends 
in the data showed that with complete infor- 
mation, Complex Judges decreased is. 
accurate predictions of similarity, and In- 
creased their inaccurate predictions of differ- 
ences. Perhaps Complex Judges would have 
been more accurate if they knew more about 
the normative responses to the items, or if the 
Criterion had recorded more individualistic 


responses. 
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TABLE 4 


ANALYSIS OF VARIANCE, PREDICTED SIMILARITY 
BETWEEN JUDGE AND INTERVIEWEE 


df Variance F 


Between Judge Source 


Between Subject Pairs 6 38.85 

Judge Type 1 150.89 4,14** 

Information X Order 1 2.89 

Inf. X Subj. Type 1 48.89 

Order X Subj. Type 1 11.57 

J. Type X Subj. Pairs 6 28.27 

J- Type X Inf. X 1 14.29 
Order 

J. Type X Inf. X 1 137.29 3.77* 
Subj. Type 

J. Type X Order X 1 22.32 
Subj. Type 

Residual Between 36 36.44 

Within Judge Source 

Subject Type 1 6.04 

Subj. Type X Subj. 6 23.08 1.43 
Pairs 

Information 1 89.29 5.593" 

Order 1 54.32 3:36" 

J. Type X Inf. 1 26.03 

J. Type X Order 1 24.14 

J. Type X Subj. Type 1 20.57 

Inf. X Order X Subj. 1 36.57 2.26 
Type 

Residual Within 43 16.15 

Residual Bet./Resid- 36/43 2.2614 


ual Within 


* 10 > P > 05. 

"05 > P> .01. 

ees 01 > P > .001. 

Judging interviews in succession produced 
effects upon accuracy and predicted similarity. 
The trends for the second interview were 
toward decreased accuracy, (p < .10 for 
unweighted accuracy and p < .05 for weighted 
accuracy) and greater prediction of similarity 
(p < .10). The decrement in accuracy was 
largely confined to Complex Judges (p < .05 
for the Judge Type X Order interaction on 
both accuracy measures). Simple Judges did 
about equally well with interviews judged in 
either position. The deterioration of prediction 
among Complex Judges may well reflect their 
inability to continue to respond to the individ- 
ual interviewee’s characteristics. If a Complex 
Judge tried in his predictions to distinguish the 
second interviewee from both himself and the 
first interviewee, the alternatives remaining for 
his choice were likely to be those least fre- 
quently selected for self-description. 

In general, it appeared that Complex Judges 
responded in terms of differences between 
people, while Simple Judges responded in 
terms of similarities. Viewing people as similar 
to oneself as an initial hypothesis may help 
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Simple Judges to understand and adjust to 
their own interpersonal environments. They 
differentiate only when sufficient information 
is available. Complex Judges, on the other 
hand, seem to seek a unique characterization 
of a person and to be fairly able to pick out 
aspects of his individuality even with little 
information. It appears that the general tenor 
of the experimental instructions, which im- 
plied judging the uniqueness or differences 
between individuals, had a stronger effect 
upon the Complex Judges. The Simple Judges 
of the present study resemble Lundy’s (5) Ss 
who were instructed to attend to their own 
reactions while evaluating another person, and 
Fiedler’s (2) Ss who rated high in assumed 
similarity between opposites. 

Do we have data that can tell us whether 
the judges actually perceived and responded 
in terms of characteristics of the individual 
interviewees, as well as responding in terms of 
their expectations of how college males would 
answer a questionnaire? The changes in accu- 
racy and predicted similarity with information 
give support to the view that information as 
well as expectations determined the predic- 
tions. Had data on the effects of information 
and order not been obtained, our conclusions 
might have been different. We would have 
seen that accuracy correlated with similarity, 
and that variables such as the proportion of 
similarities and proportion of differences cor- 
rectly predicted did not correlate, and might 
have concluded that information specific to an 
individual interviewee was not significant for 
accurate prediction. 

The results suggest that research directed 
toward explaining the prediction process 
should include (a) more specific treatment of 
judges’ self-constructs or the ways in which 
they compare themselves to others, (b) manip- 
ulation of the similarity between judge and 
judged as well as control of the similarity of 
their responses to group norms, (c) more 
careful control of variations in the content of 

information given to judges, and (d) systematic 


variation of the relevance of the content of 


the information to t 
tions. 


he content of the predic- 
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experimentally. A model treating social per- 
ception as an example of concept formation 
suggested the combined manipulation of 
stimulus and individual difference variables. 
It was expected that judges possessing a num- 
ber of differentiable concepts regarding others 
(Complex Judges) would be better predictors 
than judges having undifferentiated inter- 
personal constructs (Simple Judges). It was 
hypothesized that given more information 
about a person, judges would predict better 
and that Complex Judges would best utilize 
full information. Scores on the Kelly Role 
Repertory test defined the dimension of 
conceptual simplicity-complexity. Information 
was presented by means of recorded interviews, 
each type of judge judging each type of inter- 
viewee, and judgments were made on the basis 
of two different amounts of information. The 
results indicated that though Simple and Com- 
plex Judges were about equal in predictive 
accuracy, they differed in the ways in which 
this accuracy was achieved. Simple Judges 
tended to assume similarity between them- 
selves and others, while Complex Judges 
tended to differentiate between themselves and 
others. The increase in accuracy resulting 
from the provision of more complete informa- 
tion also seemed to indicate that the judges 
responded in terms of the individual charac- 
teristics of the interviewees, as well as in terms 
of group norms. 
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CONFORMING BEHAVIOR UNDER TWO CONDITIONS 
OF CONFLICT IN THE COGNITIVE FIELD! 


FRANZ SAMELSON? 
Mental Health Research Institute, University of Michigan 


NE of the oldest topics in experimental 
social psychology is the effect of a 
. communication from other persons on 
an individual’s opinion or judgment: suggest- 
ibility and conformity. While the early experi- 
ments simply demonstrated this effect in 
various situations, later ones have shown that 
the amount of conformity obtained is related 
to three clusters of factors: (a) the task stimu- 
lus, its difficulty and ambiguity (5, 14); (b) 
the Social influence, its source, content, and 
discrepancy with either the true facts or the 
individual’s initial judgment (2, 9); and (c) 
characteristics of the individual, permanent or 
temporarily induced (6, 12). While the phe- 
nomenon is well enough established—in fact, 
seems to be one of the cornerstones of social 
psychology—the interpretation of the change 
Processes, their “rationality” or “irration- 
ality,” has been the subject of controversy 
(2, ch. 14). 
In an attempt to clarify the issue, the 
distinction between two types of forces, viz., 
“normative” and “informational” social in- 


fluences, has been suggested recently by 
Deutsch and Gerard (7), thus linking the topic 
with the theory of reference groups (11). The 
importance of normative social influences for 
conforming behavior is shown in the experi- 
ment by Deutsch and Gerard, and a similar 
one by Mouton et al. (16), in which anonymity 
of the subjects (Ss) reduced the amount of 
conformity obtained significantly, but by no 
means completely. 

Informational social influences, or the crea- 
tion of a frame of reference, as in Sherif’s clas- 
sical study with the autokinetic phenomenon 
(21), seemed to be especially important in the 
realm of “social reality,” i.e., in regard to 


1 This paper is based on a doctoral dissertation sub- 
mitted to the Psychology Department of the University 
of Michigan. The writer wishes to express his grati- 
tude to Daniel Katz, chairman of the doctoral commit- 
tee, and its members, especially Theodore M. Newcomb 
and Russell A. Clark. The study was carried out under 
a research training fellowship granted by the Social 
Science Research Council. 

2 Now at Kansas State College. 


issues which are not subject to direct observa- 
tion or independent test. Yet Asch’s famous 
experiment on conformity (2, ch. 16) and its 
subsequent modifications (v.s.) seem to indicate 
that even in simple perceptual tasks (including 
immediate memory) wrong information, con- 
tained in other Ss’ responses, had some power 
to change the judgments of the naive observer. 
The implication seems to be that even simple 
perceptions may need some “consensual 
validation.” This point touches on the old 
philosophical problem of “reality” and knowl- 
edge about reality, and the suggestion given 
is quite similar to the philosophical position? 
that sense experience is transformed into 
experience of reality “out there” of material 
objects only after its corroboration by other 
observers, or after repeated independent 
observations by the individual, especially in 
the form of what Allport (1) has called deno- 
tational procedures. (Typically, this latter 
course of action is made impossible for the Ss 
in conformity and suggestion experiments—a 
restriction which may seriously limit the 
generality of the results.) In other words, 
viewing perceptions as hypotheses (4), one 
may expect that even after their initial for- 
mulation they may still be susceptible to 
confirmation or infirmation from reports of 
other observers. 

If these premises are accepted and the 
discussion restricted to the informational 
component of the influence attempt, the 
following analysis of Asch’s experiment seems 
plausible. The task stimuli (lines to be com- 
pared) were by themselves quite free from 
ambiguity. Yet this clearness of the perception 
combined with the similarly unequivocal but 
contradictory evaluation by the other ob- 
servers to create an unstable total situation 
for the naive S. He was, in fact, required to 
choose between two mutually exclusive mes- 
sages, yet had no independent basis to discount 


3 For a recent discussion, see the article by Fritz (8) 
and its distinction between S-statements (statements 
about sense-data) and M-statements (about material 


objects). 
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either of them. Since at the outset the task 
seemed quite simple, the initial expectation of 
observer agreement was presumably very high. 
The surprising discrepancy between the S’s 
perceptions and those of a number of other 
Ss, unanimous among themselves, created a 
conflict in the cognitive field which could not 
be resolved in a satisfactory manner by simply 
denying or disregarding the socially trans- 
mitted information. Instead, Ss tried to 
account for the disagreement by means of 
additional hypotheses which, because of their 
ad hoc nature, left the Ss rather disturbed and 
dissatisfied whether they had yielded or re- 
mained independent. 

On the basis of these considerations, the 
following hypothesis was proposed: The 
amount of conformity in situations of the type 
studied by Asch is reduced if Ss are given 
secondary information which, if utilized, 
reconciles the contradiction between their own 
perception of the task stimuli and the majority 
judgments, and thus enables them to account 
for the disagreement. This hypothesis is some- 
what similar to the (untested) Hypothesis VI 
of Deutsch and Gerard: “The more uncertain 
the individual is about the correctness of the 
judgment of others, the less likely he is to be 
susceptible to informational social influence in 


making his judgment” (7, p. 630). 
METHOD’ 


In order to test this hypothesis, two condi- 
tions were designed under which Ss found 
themselves in disagreement with a secretly 
instructed majority. Their task was to identify 
simple visual stimuli that were presented 
tachistoscopically but above threshold dura- 
tion. This method of presentation, rather than 
Asch’s original procedure, was used since it 
the intended manipulations more 


made k 
plausible. In the control, or Full Conflict (F), 
condition, Ss were put—as were Asch’s Ss— 


into the conflict situation without any addi- 
tional information. In the Reduced Conflict 
(R) condition, the Ss were first given the 
ore detailed description of procedures and 


4 For a m 


19). 
a t ae of Asch’s procedures have 


been used effectively by others, eg., Olmstead and 
Blake (18). 

6 These tit 
amounts of va 
forming responses. 


les of conditions refer to the predicted 
cillation between independent and con- 
Continued independent behavior, 
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opportunity to infer that their recognition 
thresholds were lower than those of the other 
participants. Subsequently, they found them- 
selves in the same conflict with the majority 
as the control Ss. 


Subjects 


Fifty beginning psychology students, all male fresh- 
men and sophomores at the University of Michigan, 
were used as Ss. They were assigned at random to either 
of the two conditions. This prior assignment, together 
with scheduling difficulties, eventually left the two 
groups slightly imbalanced (Vr = 26, Nr = 24). 


Task 


Each of these Ss met with four other Ss, who were 
in fact secretly instructed accomplices of the Æ, for 
an experiment “on the span of apprehension.” Their 
task was to recognize a four-consonant nonsense syl- 
lable presented at .2 sec., and to select it from four al- 
ternative syllables presented approximately 1 sec. 
afterwards, and left on the screen till all Ss had re- 
sponded. The three distractors had been constructed 
in such a way that one letter and/or its position was 
changed. For example, for the stimulus syllable FDMR, 
the wrong alternatives were: PDMR, PDRM, FDRM. 
All five participants announced their choice publicly in 
a fixed sequence by calling out the number of the al- 
ternative they believed to be correct. The naive S was 
always fourth to report. Twenty trials with different 
syllables were given. On twelve, all four accomplices 
gave the same wrong answer. On the remaining ©8 t 
trials, spaced in an irregular order over the experiment, 
they chose unanimously the correct alternative. The 
sequence of critical and neutral trials is indicated in 
Figure 1. 


Manipulation of Conditions 


The experimental treatment, designed to provide = 
in the R condition with information potentially rect 
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Fic. 1. PROPORTION OF CONFORMING RESPONSES ON 
Successive CRITICAL TRIALS IN Two 
EXPERIMENTAL CONDITIONS 


observers. In the Full Conflict condition, no such prac- 
tice was given. The objective informational as well as 
normative social influence toward conformity, i.e., 
the majority’s agreement on wrong responses in the 
test trials, remained identical over both conditions. 


Apparatus and Setting 

A three-hundred watt projector with a Keystone 
flashmeter attachment was used for presentation of the 
stimuli. These consisted of four capital letters typed on 
cellophane and mounted for projection. The distance 
from screen to projector was 24’, to the Ss about 19’. 
The Ss sat in a row, separated from each other by small 
screens in order to reduce the demands on the accom- 
plices’ acting skill. The back half of the room was illumi- 
nated in order to permit Ss to write during the session, 
as well as to avoid excessive contrast in illumination 


level. 


Additional Data 

After each judgment, Ss indicated their uncertainty 
about the correct response on a five-point rating scale 
from “absolutely certain” to “probably wrong.” One 
of the objectives of this measure was to check, by an 
intra-individual comparison, whether Ss would differ- 
entiate in their ratings between neutral trials and trials 
on which they had conformed. Since outwardly the 
response pattern of the group would be identical in 
both cases, higher uncertainty ratings on the conform- 
ity trials would be expected if the Ss had some aware- 
ness of conforming to wrong majority responses. Such 
a result would give support to the assumption that Ss 
did not conform simply because they could not recog- 
nize the visual stimuli. , 

Immediately after the last trial each S was given a 
Questionnaire containing three rating scales and two 
Open-ended questions. The first two ratings dealt with 
the difficulty of the task and the S’s visual acuity. On 
the third scale he was asked to indicate a judgment of 
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his ability for a task of the type just performed. This 
question was designed to test whether the experimental 
manipulations had been perceived in the intended way. 
The remaining two items were a question asking 
whether the S had been influenced by someone else’s 
responses, and a conditional question about his expla- 
nation of the disagreement with other Ss “in case such 
disagreements had occurred.” 

This task was followed by oral questioning of the 
naive S under the pretext of a group discussion of the 
experiment. The topics covered in this interview were 
the S’s awareness of the disagreement with the majority, 
his feelings about who had been right, and his way of 
accounting for the discrepancy, gradually leading to 
the question whether he had developed any suspicions 
about the behavior of the majority. None of the Ss had 
been informed beforehand by rumor, and none had 
surmised the extent of the deception during the experi- 
ment, although a few had begun to suspect that the 
experiment might not be quite what it purported to be. 
Finally, the purpose and procedures were fully explained 
to every S since all of them showed some signs of dis- 
turbance about the puzzling course of events. 


Personality Data 


Some weeks earlier, in the context of another experi- 
ment, measures of the Ss’ Achievement and Affilia- 
tion motives had been obtained. These projective moti- 
vation measures are based on stories written in response 
to TAT pictures and scored according to conventions 
developed by McClelland et al. for need Achievement 
(15), and by Atkinson ef al. for need Affiliation (3). 
Need Achievement scores are presumably indices of 
the strength of motivation “concerned with perform- 
ance in competition with some standard of excellence” 
(15, p. 111). Need Affiliation, or motivation to be so- 
cially accepted, is conceived as “a disposition to move 
toward others to elicit positive affective responses 
from them” (3, p. 409). 

At another time, the Ss had also filled out the SA-S 
personality inventory’. This questionnaire consists of 
eighty items drawn from Cattell’s Sixteen P F Inven- 
tory and presumably measures two orthogonal personal- 
ity dimensions: anxiety and social approach or “per- 
meability.” “Individuals scoring at the permeable 
extreme tend to be sensitive to stimuli, socially oriented, 
flexible, and imaginative” (22, p. 324). 

Although these personality data had been obtained 
before the experiment, it had been impossible to evalu- 
ate them in time to match Ss in the two conditions. 
However, subsequent tests showed no significant dif- 
ferences between conditions on any personality measure 
or intelligence, as measured by scores on the American 
Council on Education test (ACE). 


RESULTS AND DISCUSSION 


Differences between conditions. The first step 
in testing the hypothesis was to ascertain that 
the experimental manipulations had been 
effective. A large difference between conditions 


1 These personality data were graciously made avail- 
able by Neil A. Carrier and John W. Atkinson. 
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TABLE 1 
NUMBER OF CONFORMING RESPONSES PER SUBJECT IN Two EXPERIMENTAL CONDITIONS 
Number of C i 
Godta umber of Conforming Responses Mean 
01234567 8 9 10 ik 12 

Full Conflict —— FF 8 T § Be ae ey 2 — 26 4.7 i 
Reduced Conflict nO Ss 2 tA me gel 1 — 24 3.5 


on Ss’ postexperimental ratings of their 
ability for the task indicated that the simu- 
lated practice had indeed succeeded in struc- 
turing the situation differently for the two 
conditions. Ss in the Reduced Conflict con- 
dition, who had had the initial practice, rated 
themselves significantly more often as “above 
average” than the Full Conflict Ss (p < .001). 

Comparing the two conditions on the 
dependent variable, conformity, we find that 
the difference is in the predicted direction (see 
Table 1). However, because of the large 
variation within conditions, the distributions 
overlap each other considerably. They are also 
positively skewed. In order to test the signifi- 
cance of the mean difference, a transformation 
of the raw data to arc sin W/%, as given by 
Snedecor (23), is appropriate, since the scores 
represent not true measurements but sums of 
data in zero-one form (independence of con- 
formity per trial). The resulting ¢ of 1.81, with 
48 df, is significant beyond the .05 level (one- 
tailed). The null hypothesis of no better 
performance (i.e. less conformity) in the 
Reduced Conflict condition can be rejected 
with reasonable certainty. 

Another way of looking at the difference 
between conditions is given in an analysis by 
trials. The proportions of conforming responses 
on each critical trial are shown in Figure 1. 
On only two trials is the difference in con- 
formity between Full and Reduced Conflict 
condition negative. A simple sign test shows 
that the smaller proportion of conforming 
responses per trial occurred significantly more 
often in the Reduced than in the Full Conflict 
condition (p < .02, one-tailed). A Wilcoxon 
signed rank test increases the significance of 
the difference beyond the .01 level. 

In addition, Figure 1 shows that on the 

/ iwo trials only a few Ss in either condition 
d. The direction of the (not signifi- 
rence between the two conditions 
eversed. This result, which was 
d, helps to reject an otherwise 
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serious objection to the procedures employed. 
It indicates that the overall group difference 
is not due simply to a better adaptation of the 
Reduced Conflict Ss to the physical features 
of the task as a consequence of their prior 
practice with a similar one, Apparently, the 
effects of the majority response reached their 
real impact on the Full Conflict Ss only on the 
third critical trial, while the majority of the 
Reduced Conflict Ss was still able to withstand 
them. 

The subsequent fluctuations from trial to 
trial are probably due, in part, to differences 
between stimuli. These effects are hopelessly 
confounded in the present design with effects 
of the spacing of neutral and critical trials®. 
Yet there appears to be a general trend toward 
decreasing differences between conditions later 
in the experiment, presumably because the 
effects of the differential treatment were 
wearing off. A chi-square test of low vs. high 
conformity by condition for each half of the 
experiment results in a significant difference 
in the predicted direction (p = .025, one 
tailed) in the first half. In the second half, 
the difference is no longer significant (althous 
by a parametric test it still is). This decrease 
of the differential effect is not too surprising 
if we recall that the treatment actually con- 
sisted of a rather artificial distortion of the true 
facts. 

A comparison of conditions on the uncer- 
tainty ratings indicated a tendency towa" 
lower mean uncertainty for the ae 
Conflict condition (.15 > p > .1) failing 
reach significance. This lack of difference Þa 
been expected, since the scale points were m 
well enough defined to ensure interindividut 
comparability. J7tra-individual analysis of t d 
different response categories, however, show® 
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that most Ss in both conditions had rated their 
conforming responses on the average as more 
likely to be wrong than their correct responses 
in agreement with the majority on the neutral 
trials. This significant difference (p < .001 
with a sign test) supports the assumption 
underlying the experiment, that Ss would 
not conform simply because the visual stimuli 
by themselves were too ambiguous. No con- 
sistent differences appeared in a comparison 
of uncertainties for conforming and independ- 
ent responses. 

The remaining questionnaire responses 
showed lower ratings for the difficulty of the 
task in the Reduced Conflict condition, falling 
Short of significance (.1 > p > .05), and no 
differences in the ratings of the Ss’ visual 
acuity and the influence of the majority re- 
Sponses. In answer to this last question, about 
half the Ss in either condition insisted that 
they had not been influenced at all by the 
majority. Their insistence decreased notably 
when they became aware, during the discus- 
sion, that precisely this effect of the majority 
responses had been the focus of the Z’s at- 
tention. 

Beyond this, the interview did not produce 
any clear-cut results. Although reactions 
varied considerably, they all indicated that 
the S's understandably did not have a clear and 
consistent picture of the past happenings. 
Most of them would insist that they had been 
right whenever they had disagreed with the 
majority. But in the next sentence they 
would admit that the judgment of four people 
had a much higher chance of being correct 
than their individual one, and therefore they 
probably had been wrong. 

Personality correlates of conforming re- 
sponses. The large variation within both 
conditions indicated that an analysis of the 
personality data might be fruitful. On the 
basis of earlier results (15), need Achievement 
was expected to be negatively related to 
conformity. The meaning of the other per- 
sonality measures available—need Affiliation, 
social approach, and anxiety—Wwas not specific 
enough to warrant any predictions. — , 

While the details of this analysis and its 
tentative interpretation are reported elsewhere 
(19, 20), the major results will be briefly 
described here. A correlational analysis showed 
that, while there was no relation of conformity 
to need Affiliation and anxiety or to intelligence 
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(ACE), the other two personality measures 
interacted with the experimental treatment. 
In the Full Conflict condition, both need 
Achievement and social approach were nega- 
lively correlated with conformity (although not 
significantly). In the Reduced Conflict con- 
dition, however, both measures showed a 
positive relation to conformity, which for need 
Achievement was significant at the .05 level 
(r = + .45, N = 22). The difference between 
the two conditions with respect to the cor- 
relations with conformity was significant 
at the .05 level for need Achievement, and the 
.02 level for social approach. The fact that 
the two measures were obtained from quite 
different and experimentally independent sets 
of data (projective stories and questionnaire 
responses, given at different times) strongly 
suggests that this interaction effect is not 
spurious. 

An analysis of variance indicated that the 
interaction can entirely be attributed to the 
(expected) reduction in conforming responses 
of the medium and low need Achievement and 
social approach scores in the Reduced Con- 
flict condition. Contrary to expectations, the 
high scorers on either measure showed no 
absolute mean difference between conditions; 
but their relative position was changed from 
less to more conformity by the shift of the 
other subgroup means. 

While the over-all outcome of this analysis 
thus confirmed the prediction, the conclusion 
about the effects of the experimental treatment 
has to be modified. The reduction of conflict, 
achieved by means of an implicit argumentum 
ad hominem, decreased the amount of con- 
formity exhibited by the majority of the Ss. 
But for the Ss scoring high on need Achieve- 
ment or social approach, this procedure was 
not effective. Since there is no indication that 
these Ss failed to perceive the difference 
between themselves and the accomplices which 
the experimental manipulation had attempted 
to create, the explanation seems to be that 
the assumed reduction of conflict did not 
actually occur for these individuals. As they 
were highly motivated to achieve, their 
strong motivation to be correct may have 
kept the conflict between their own perceptions 
and the majority judgments alive. Thus, they 
were forced to vacillate between independence 
and conformity, and thereby to conform more 
than the other Ss in this condition, who could 
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make better use of the cues indicating that the 
majority might be in error. 

No entirely satisfactory interpretation of the 
results obtained with the personality measures 
has been found’. It should be mentioned, 
however, that the few related experiments on 
conformity employing contrasting experi- 
mental conditions as well as personality meas- 
ures have shown similar interactions between 
condition and personality (10, 12, 16). 

Further theoretical considerations. It will be 
recalled that the majority of the Ss showed 
the predicted reduction of conforming re- 
sponses after they had obtained information 
enabling them to account for the discrepant 
majority reports. This information had been 
implicit in the events of the practice, which 
employed different stimuli and rather different 
(shorter) exposure times than the test trials, 
In this situation, one cannot speak simply of 
reinforcement of specific responses like the 
individual norm in the autokinetic situation. 
Neither was disagreement with the majority 
directly reinforced since in the practice, the 
accomplices’ responses eventually agreed with 
the \S’s original one. Of course it can be argued 
that an internal response of thinking oneself 
right was reinforced (13), However, since 
generalization of this response along unspeci- 
fied dimensions must be assumed in order to 
cover different stimuli and different social 
situations, the value of this concept seems 
debatable. The interpretation proposed here 
is that the strengthening of the Ss’ independ- 
ence involved cognitive processes, especially 
the Ss’ inference about relative abilities of the 
observers, and its application to the new 
conflict situation in which the Ss found them- 
selves. In other words, cognitive relationships 
among all three elements of the standard 

social situation—what Newcomb (17) has 
called the co-orientation system—were jn- 
volved. Pp 

An exclusive emphasis on cognitive pro- 
cesses, of course, does injustice to the com- 

plexity of the total situation. The development 
of the conflict, and the form that its resolution 
takes, depends on a number of additional 
factors. Yet it seems that, at least to some 
extent, two aspects of the situation are separa- 
ble. To use the language of communication 


9A more extensive treatment of these results can 
be found in (19, 20). 
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theory, it can be said that the receiver, con- 
fronted with different signals in different 
channels, has to make the decision as to which 
of the two is “noise” and which the correct 
message. And in order to make this decision he 
needs additional information. Which message 
he in turn ¢ransmits depends on additional 
considerations, if he realizes that the factually 
correct message may not be “socially correct,” 
and if he expects feedback from the social 
environment. In fact, these effects may easily 
react on the first-stage decision, and in the 
extreme case the S may give up the attempt 
to collect independent information and rely 
entirely on the social channel. 

Yet the thesis of this study was that a 
satisfactory analysis of conformity requires 
more than the definition of a phenotypic class 
of responses coinciding with the modal values 
of the group. Not only, as Asch (2) has argued, 
must there be a genuine conflict between the 
different inputs, which is not present in 
experiments with ambiguous or carefully 
equated stimuli. The prediction of independent 
behavior depends on additional information 
about the source of the conflict, Simply to 
demand of the S (or to train him) to disregard 
the socially transmitted information without 
accounting for it (13) may not be “rational 
either, since so much of our knowledge about 
reality is received through social channels, 
and “social reality” may not be only a delim- 
itable part of our environment but may, in 
principle, permeate it throughout. 

The large interindividual variation in the 
present experiment and its relation to motiva- 
tional variables makes it abundantly clear 
that a cognitive theory alone is not able to 
explain the events. The readiness to draw the 
conclusion that the majority is wrong, as We 
as the willingness to look for information 
indicating this possibility, may be, in a mider 
context, important factors in the determinatio! 
of conformity or independence. 


SUMMARY 


Fifty Ss were given the task of identifying 
nonsense syllables presented tachistoscopically 
above threshold duration, while being com” 
fronted with wrong responses of a majority O5 
secretly instructed Ss. In simulated pan 
trials, half the naive Ss had been given t a 
impression that they had lower recognitio 
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thresholds than the other observers. This 
manipulation was intended to reduce the 
conflict between Ss’ perceptions and the 
majority judgments by implying that the 
majority might be in error. It was predicted 
that the amount of conformity in this condition 
would be less than in the standard condition 
which provided no information enabling the § 
to reconcile the apparent contradiction. 

The prediction was confirmed. Absence of 
the difference between conditions on the 
first two trials indicated that the main result 
was not due simply to the adaptation of the 
Reduced Conflict group to the physical 
features of the task. 

The large variation within conditions was 
found to be related to two personality meas- 
ures: need Achievement and social approach. 
The Ss with high scores on these measures 
performed rather similarly in both conditions, 
while the predicted difference was obtained 
through a great reduction of conforming 
responses by the Ss with medium and low 
Scores, 

It was concluded that the creation of a 
frame of reference permitting the interpreta- 
tion of the majority responses as in error can 
reduce the amount of conforming behavior. 

owever, some individuals may not be willing 
or able to make use of such cues. 
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THE REPRESSION RESPONSE TO IMPLIED FAILURE AS A 
FUNCTION OF THE HYSTERIA-PSYCHASTHENIA INDEX? 


CHARLES B. TRUAX 


University of Wisconsin 


mIs research was designed in order to 
test the Freudian theory of repression 


available for verbalization. 
terms of Hullian learning theory (1), the 
repression hypothesis States that verbal re- 
sponse-producing cues which have been as- 
sociated with anxiety generate anxiety itself, 
and that the act of inhibiting the verbal 
response is learned through the reduction of 
anxiety. Further, the anxiety should generalize 
to other verbal responses that are associated 
with the original anxiety-associated response. 
Repression, then, should be learned to verbal 
responses that are associated with the original 
anxiety-producing response. 

Tf repression is to be regarded as only one 
of the possible learned responses or mechanisms 
used to reduce the anxiety drive, however, 
then not all persons subjected to anxiety may 
be expected to react by repressing the anxiety- 
laden material. Erickson (3, 4) has suggested 
that scores on the Hysteria scale minus scores 
on the Psychasthenia scale of the MMPI 
differentiate subjects (Ss) that characteris- 
tically react to anxiety with the response of 
repression from those who usually respond in 
some other manner. Empirically, he found 
that a high Hy-Pt group recalled fewer in- 
completed tasks and were rated as showing 
more repression than a low Hy-Pt group. 


Specifically, then, it is hypothesized that 


those Ss scoring high on this repression index 


show more repression responses to anxiety 
; A 
than do Ss scoring low on it. l 
In an earlier study by Zeller (15), differential 
retention for two separate groups following 
failure-induced anxiety was demonstrated. 


debted to Dr. J. C. Gilchrist for 


is in 
1 Tie anther $ ms in all phases of the research. 


suggestions and criticis 


This finding, however, can be satisfactorily 
explained in terms of a decremental effect of 
anxiety upon performance in a complex 
task as proposed by Farber (6). 

The test of repression used in the present 
research involved the determination of 
whether, once new learning was established, 
retention for associations involving anxiety- 
laden material was poorer than for associations 
involving neutral material for the same S at the 
same time, and whether, when anxiety was 
yemoved, there was no differential rentention 
of the same materials. 

In a recent study of the effects of anxiety 
and failure, Truax and Martin (13) found an 
interaction between degree of personalization 
of failure and manifest anxiety. To avoid 
this problem in research on failure effects, the 
present study employed an implied-failure 
technique, involving a minimum of personal 
interaction between experimenter (E) and 5. 
This procedure has the additional advantage 
of reducing the importance of Æ in the failure 
experience, thus probably lessening the release 
of S’s anxiety in the form of hostility towar 
E. In the present study, then, E made 0 
comments to indicate S’s failure. Instead, the 
anxiety-producing task, which was irreleva? 
to the experimental task, was simply mace 


so difficult that S could not perform vell. 
Failure was thus implied by the performan 
olving, 


itself, and since the task seemed ego-inV 
anxiety could be legitimately inferred. 


METHOD 


‘ eri- 
There were six parts to the present exp a 
ional, a 


ment: (a) selection of neutral, emot10 is 
traumatic words to be used as stimuli T n 
of 18 paired associates, (b) original learn) 


5 A a : le 
the paired associates, (c) induction © an a 
for the traumatic words, (d) first relear 
emov4 


and recall of all paired associates, (e) T 
of induced anxiety for the traumatic w a 
and (f) second relearning and recall 0 

paired associates. 


rds, 


in 
r ; rds used | 
The emotional and neutral stimulus W0’ rd associa” 


the paired associates were selected from a WO! 
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ion list of 79 items. The traumatic stimuli were taken 

-rom the titles of subtests of an “intelligence test” used 
to imply failure. The response words used in the paired 
associates were three-letter nonsense syllables with an 
association value of 53 per cent taken from Glaze’s list 
(9). Since the response words were neutral nonsense 
syllables, it was anticipated that conscious suppression 
of the response would be minimized and thus permit a 
clearer test of repression. 


Selection of Neutral and Emotional Words for 
Each S 


an Waras from the word association list were presented 
F: erbrand Tachistoscope for a duration of .5 sec- 
onds to each S individually. The S was asked to respond 
immediately with the first word coming to mind. Reac- 
tion times to the nearest .05 sec. and all verbal responses 
to each word were recorded. Depressing & telegraph key 
peel nein A carte both the tachistoscope and a 
“a E ard electric clock; releasing the key stopped the 
lock at the beginning of S’s verbal response. 
walt addition, Galvanic Skin Response (GSR) changes 
rere obtained during the reaction time interval and 
Were recorded by means of a General Electric photo- 
electric continuous recorder, An ME GS apparatus 
designed by Grant (10) with dry silver electrodes aY 
tached to the fingers was used. Since the size of GS. 
changes varies with Gg base level of resistance, the 
Sensitivity of the measure for each S was based upon the 
imal Toot of conductance at his base level to permi 
ra-individual comparisons. The actual sensitivity 
of the GSR measure Scant between 30 and 600 ohms. 
The neutral words for a given S were then defined 


as the six words to which the smallest GSR deflections 
and short reaction times were given, while the cmo- 
tional words for a given 5 were defined as the six words 
to which he gave the largest GSR deflections and short 
reaction times. Long reaction time words were not used 
for two reasons: (a) long reaction times is correlated with 
association or entropy value, and (b) failure to respond 
immediately results in spuriously large GSRs, possibly 
because of anxiety over disobeying instructions to re- 
spond quickly, As a result of this selection procedure, 
each S had individualized lists of neutral and emo- 
tional words, defined by his own responses to them. In 
addition, the neutral and emotional words for each 
group of Ss were equated on the basis of the Thorndike- 
Lorge word count (13). 


Implied Failure and Traumatic W ords 


In the session with each S between original learning 
and the first relearning and recall, failure was induced 
ords. Each S was 


and associated with the traumatic w a 
given a test booklet with the words “Wisconsin Test 
of Intellectual Abilities: College Form B” displayed on 
the upper half of the cover sheet. This test consisted of 
six subtests prominently labeled with the names to be 
used as the traumatic words in the paired associate 
task: opposites, numbers, completion, reasoning, arith- 
metic, and analogies? Each subtest consisted of ten 
items of extreme difficulty. At the beginning of each sub- 
test, Æ read the title and gave a brief but glowing ac- 


2 Available on request from the author. 
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TABLE 1 
EXPERIMENTAL SEQUENCE 


Day Session Intersession Operation Involved 


Interval 

1 0 — Selection of emotional 
and neutral words 

2 I 2-3 wks. Original learning of all 
18 paired associates 

3 0 24 hrs. Anxiety induction by 
implied failure 

4 I 24 hrs. Tirst relearning an 
anxiety removal 

5 II 24 hrs. Final relearning under 
neutral conditions 

count of the abilities measured by the test, stressing 


their relationship to general intelligence. The name of 
the subtest (traumatic word) was repeated three times 
in the instructions for each subtest. The instructions 
emphasized speed, and a stop watch was used con- 
spicuously. After each S completed only half the items, 
E would click the stop watch and say “Stop.” No 
specific comments about the adequacy or inadequacy 


of the performance were made. 


Disabusal of Failure-induced Anxiety 


At the conclusion of the first relearning session, each 
§ was asked to recall the “intelligence test” that he had 
taken the day before. The Æ asked S how he thought he 
had done on the test and how he felt about it afterward, 
After listening to S’s remarks, Æ continued: 
{Actually, that wasn’t an intelligence test at all. We 
are trying to study the elect of failure on verbal 
learning, so we simply made up 4 fake test. As you 
probably noticed, the fake test was so difficult that 
everyone had to do poorly on it. Usually there aren't 
any time limits on this sort of test, but to make you 
feel that you failed we stopped everyone when they 
were halfway thru. By the way, did you notice that 
part of the response words that you had to learn were 
the same as the names of the subtests of the fake 
LQ. test? (E recorded the responses of S) Well, I 
hope the fake test didn’t upset you too much. Tomor- 
row we'll finally get to the GSR conditioning.) 


Learning, Recall, and Relearning 
S learned and relearned by the method of ad- 
justed learning (8) toa criterion of two successive cor- 
rect anticipations for each paired associate. The stimu- 
lus and the response words were printed on different 
sides of 3 x 5 cards with the nonsense syllables paired 
with the stimulus words so that a given nonsense syl- 
Jable was paired with each of the three classes of stimuli 
equally often. With this method of counterbalancing, 
any difficulty due to the nonsense syllables was con- 
trolled. 

The 18 cards containing the paired associates were 
shufiled between each trial to eliminate serial position 
effects. The stimulus and then the response words were 


presented to S separately for a two-sec. period. A one- 


sec. interval between pairs and a five-sec. interval be- 
tions was used. After two 


tween successive list presenta 


Each 
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successive correct anticipations for a given pair, that 
pair was discarded, and learning or relearning continued 
with the remaining pairs. 


Experimental Sequence 


The procedural sequence is presented in Table 1. It 
will be remembered that S was led to expect GSR condi- 
tioning on the days following the original learning of 
the paired associates. By this technique, it was hoped 
that any inter-session rehearsals of the material would 
be minimized. 


Subjects 


Individuals from the upper and lower five per cent 
on the Hy-Pt distribution were selected from 650 stu- 
dents in the beginning psychology course at the Uni- 
versity of Wisconsin. These Ss were then contacted and 
asked to serve in a “problem solving” experiment. The 
first 20 Ss from each end of the Hy-Pt distribution that 
were contacted volunteered and were used in the present 
experiment. Thus, a total of 40 Ss served in this study, 
with seven women and thirteen men in the high Hy-Pt 
group and seven women and thirteen men in the low 
Hy-Pt group. Identical procedures were used for both 


groups. 
RESULTS 


The performance scores used as the depend- 
ent variable consisted of difference scores 
obtained by subtracting the recall or the 
learning-relearning scores on the six neutral 
paired associates from the scores for either 
the six emotional or the six traumatic paired 
associates. Four sets of difference scores 
resulted: (a) traumatic recall difference 
scores, (b) emotional recall difference scores, 
(c) traumatic learning-relearning difference 
scores, and (d) emotional learning-relearning 
difference scores. 

Four separate analyses of variance for 
double classification with repeated measures 
were performed on the data, corresponding to 
the four types of difference scores used as the 
dependent variable (5). In each case, high and 
low groups based on the upper and lower 
five per cent of the Hy-Pt index were used as 
the classes, and measures across the three 
experimental sessions were used as the repeated 
measures. . 

The analysis of variance for the recall 
difference scores and for the emotional learn- 
ing-relearning difference scores revealed no 
significant differences. In the case of the 
analysis for the recall difference scores, the 
within Ss term also proved to be nonsignificant. 

This lack of individual variation appears to 
be due to the constricted range of scores 
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TABLE 2 


ANALYSIS OF VARIANCE FOR RELEARNING 
or TRauMATIC WORDS 


Source df Mean F 
Square 
High & low groups Hy-Pi 1 130.34 = 
index 
Between Ss within groups 38 321.07 3.70*** 
Sessions I, II, III 2 631,33: 7.52*** 
High & low groups X 2 1,601.74 18.47*** 
Sessions 
Sessions X Ss within 76 86.72 
groups 
Total 119 


*** Significant beyond the .001 level of confidence. 


caused by relearning. The meaningfulness of 
these null results with recall scores is, therefore, 
hard to assess. 

The results of the analysis of the learning 
and relearning of the traumatic words in 
comparison to the neutral words is shown in 
Table 2. The main effect of learning-relearning 
sessions is significant beyond the .001 level of 
confidence, indicating that ‘retention of 
traumatic words for all Ss changed in relation 
to the neutral words as a function of the 
operations of anxiety induction by implied 
failure and the removal of this anxiety by 
instructions. That is, the traumatic words, 
which were harder to learn in Session I, became 
easier to learn by Session III. Further, the 
interaction of high and low Hy-Pt groups by 
measures across learning-relearning sessions 
was significant beyond the .001 level of con- 
fidence, indicating that the high and 
Hy-P1 groups responded differently to a? 
induction and anxiety removal. , 

To evaluate further the interaction of high 
and low Hy-Pt groups by measures an 
experimental sessions, the Duncan Multip © 
Range Test was performed on the data 
Since extreme groups were used in the presen 
study, the .01 level was taken as the aeni 
able level of significance. The results of A 
test indicated that the high and low groups C5 
not differ on original learning or ON the fin 
relearning of the traumatic words as compare 
to the neutral words, but did differ 1” t a 
relearning session after anxiety ha pa 
induced (Session II). In Session I, whe 
repression should be manifest, the high gom 
took more trials to relearn the tammi 
than the neutral material as compared to eit 
original learning or final relearning- 


xiety 
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O——O HIGH Hy-Pt GROUP by anxiety. That these findings were not the 

15 | &----4 Low Hy-Pt GROUP result of supression, rather than repression, 
is indicated by the answers to the question: 
“Did you notice that part of the response 
words that you had to learn were the same as 
the names of the subtests of the fake IQ test?” 
All Ss were quite surprised when this rela- 
tionship was pointed out to them. If one 
grants that there was no reason for the S 
to lie at this point—or even that Ss are not 
good actors—then the existence of the class 
of traumatic words was not known by the 
Ss. It is felt that the 24-hr. interval before 
and after anxiety induction was crucial to 
the present study. If the failure had occured 
in temporal juxtaposition to the paired- 
associate task, it seems likely that many 
Ss would have “seen through” the experiment. 
Unfortunately, the results, although showing 
significant differences in repression, are not 
free from confounding of the method used 


MEAN DIFFERENCE SCORES FOR TRAUMATIC 
MINUS NEUTRAL WORDS 


SESSIONI SESSION SESSION M 

Fic. 1. MEAN DIFFERENCE SCORES FOR À i k 
TRAUMATIC WORDS with the act of repression. That is, the amount 

of retroactive inhibition produced by the 


group, on the other hand, took fewer trials to method of adjusted learning is not known. 
Since the traumatic words tended to be more 


relearn the traumatic than the neutral material š ma S 
difficult in original learning, they were not 


in Session II as compared to original learning. n 
The difference between Session II and Session only presented on more trials, but they were 
II for the low group, however, fell short of the also learned last. This, then, allowed for more 
required significance level ($ = .06). retroactive inhibition of the neutral words 
Tn Fig. 1, the mean difference scores for the which would tend to make for better relative 
high and low Hy-Pi groups for the traumatic retention of the traumatic words in Session TE: 
Words are plotted as a function of the sessions. Thus, since repression was not operative in the 
Each point represents the trials to the criterion low group to counteract the effect of retro- 
taken by 20 Ss on the six traumatic minus active inhibition, these Ss showed faster 
the six neutral words. It will be remembered elearning of the traumatic words in Session 
that Session I was a neutral condition of I as compared to the neutral words. In 
Original learning, Session TI the relearning Session Ii, ee cae inhibition 
after anxiety induction by the technique of would act on the — wor cee they 
implied failure, and Session III the final were relearned first in Session LL. If this 
Telearning after the removal of the anxiety by situation actually did occur, there is no need 
Instructions to postulate an antithetical process such as 
j vigilance to account for the performance of 
DīıscussIoN the es group. wane the Fr obtained 
5 , with the traumatic words are due to a proces: 

The Learning-Relearmng M eine of Kapn es of vigilance in the low group and of pee 
tak aipe i in the high group or, instead, to a combination 
of ik as the source of et with the of repression and the effects of retroactive 
Re prenen pe anne proper an d the inhibition inherent in the method of adjusted 
h ea ae fs Hy- Pr index differentiates eaog can only be tete by further oe 
Tepressors from non-repressors. As shown in When emotional words are Ga as a 
Fig. 1, the high group tended to show repres- source of anxiety, the lack of significant dii- 
ferences between high and low groups 1n 


sive responses to anxiety, while the low group i L ps z 
tended to be facilitated in their performance the retention of anxiety-laden material is 


sion 


When the operation of 


———————— 


192 


not consistent with past research where dif- 
ferences have been found in relearning scores 
(14). Such positive results may have been less 
a function of the effects of anxiety associated 
with emotional words than of the use of long 
reaction times as an indicator of the words’ 
emotionality. Emotionality defined in terms 
of long reaction times tends to be related to 
association value, which is also measured by 
reaction time, or to entropy value. Thus, 
long reaction times indicate low association 
values and short reaction times high associa- 
tion value for the individual S. Studies report- 
ing differences in retention of emotional and 
neutral words and using reaction time as a 
measure may have confounded the effects of 
anxiety and association value. Since no 
difference in the retention of emotional 
words was found in the present study where 
emotional and neutral words were equated in 
association value by means of (a) reaction 
time for each S and (b) the Thorndike-Lorge 
word count, this methodological point merits 
further attention. 


The Recall Measure of Repression 


The design of the present experiment did 
not seem adequate to allow for a clear test of 
retention of anxiety-associated words in 
terms of recall scores. A range of only seven 
scores was possible, and the effect of relearning 
reduced the observed individual differences 
to nonsignificance. This aspect of the hypothe- 
sis, then, must be regarded as essentially un- 


tested. 


Implications of the Effect of Implied Failure 

The results indicate that anxiety does 
develop in a situation where S perceives 
himself as not doing well, even though no 
knowledge of the adequacy of his performance 
is given to him by £. This suggests that, in 
studies of learning designed to evaluate dif- 
ferences in the difficulty or complexity of a 
task, there exists the very real danger of 
confounding the effects of task complexity 
with the effects of failure-induced anxiety. 
Tt is common for Ss to discuss an experiment 
with others who have already served in it 
before they volunteer to participate. If the 
experienced S has served in a simpler task and 
the naive S is to serve in a more complex one, 
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then the experimental situation may imply 
failure to the naive S on the basis of his false 
performance norms. The resulting confound- 
ing of task complexity with the effects of 
failure-induced anxiety should be particularly 
prevalent in such studies as those of concept- 
formation, where the nature of the task re- 
mains unchanged but the problem increases 
in complexity. Thus, for example, the presence 
of irrelevant information might result in 
greater time consumption while the occurrence 
of failure-induced anxiety might result in 
an inability to complete the task successfully. 

To control for the differential presence of 
implied failure in different levels of task 
complexity, it is suggested that (a) generous 
performance norms be given to each S, (b) 
each S be told that different Ss are given tasks 
differing in difficulty, and (c) question- 
naires be given to S at the conclusion of the 
experiment to evaluate the differential presence 
of implied failure in groups where task com- 
plexity differs. 


The Relation Between the Present Results and 
Manifest Anxiely 


Previous findings, like those reported by 
Zeller (15), have been susceptible to inter- 
pretation in terms of simply the deteriorating 
effects of failure-induced anxiety. No repres- 
sion hypothesis seems necessary. In the presen” 
study, however, where each S acted as a 
own control and the Hy-Pi groups differe 
significantly in their response to anxiety, i 
repression process appears to be clearly 
demonstrated. 

The present results cannot be han 
drive-anxiety interpretation alone. Since 
on the Manifest Anxiety Scale were av 
for the Ss used in the present research, the ty 
could be divided into high and low anien 
groups. The high Hy-Pt group is a low anxi? H 
group (all scores less than 13), while the Hs 
Hy-Pt group is a high anxiety group “a 
scores greater than 23), Previous results j 
the interaction of individual differences ø 
MAS scores and failure anxiety are consiste, 
with the notion that anxiety acts as @ ae 
(6, 13). Considering only the drive aspet jq 
anxiety in the present study, however, ui y 
lead to the prediction that the high an* 
group would perform more poorly 


dled by * 
scores 
ailable 
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low anxiety group on Session II (where anxiety 
is present). The results are in exactly the 
opposite direction. 

While the present results are not at all a 
contradiction of the drive-anxiety hypothesis, 
the results do demand that more attention 
be Placed on habits associated with anxiety. 
It is suggested that the Hysteria variable? is 
a measure of the specific habit of repression 
as a response to anxiety. Viewed from this 
Perspective, the results are consistent with 
the clinical notion of repression and with 
current Hullian learning interpretations of 
the interaction of drive and habit. 


SUMMARY 


The present research was carried out in 
order to study the effects of anxiety induced by 
ee eae in relation to the repression 
i Ypothesis and the personality factor involved 
n the Hysteria scale minus the Psychasthenia 
Scale of the MMPI. 

A total of 40 Ss served in the experiment: 
20 Ss from the upper five per cent of the Hy-Pt 
distribution and 20 Ss from the lower five per 
cent. There were six parts to the experimental 
Procedure: (a) selection of six neutral, six 
emotional, and six traumatic words to act 
as stimulus words, and 18 medium association 
Value nonsense syllables to act as response 
tors for paired-associate learning, (6) original 

arning of the 18 paired associates to a 
Criterion of two successive correct anticipa- 
tions per pair, (c) induction and association 
of anxiety with the six traumatic words, 
as recall and relearning of the 18 paired 

Sociates, (e) removal of the anxiety associated 


With the traumatic words, and (f) final recall 


and relearning of the 18 paired associates 
ted with the 


ten removal of anxiety associa 
ane words. 
he dependent variables 


in a cluster analysis of 
thogonal factors; 
of symptoms. In 
d low groups used 
at the admission 
re for the 
for the 


used consisted 


TF 

i Little and Fisher (11), 
deni Hysteria Scale found two or 
aioe | of symptoms and admission 
i hoe the subscales for the high an 
sub e present study, it was found that 
ae contributed 53% of the Hysteria sco 
rae but only 6% of the Hysteria score 

€ssors, 


of discrepancy scores obtained by subtracting 
the trials to reach criterion for the neutral 
words from trials to criterion for either the 
emotional or traumatic words. Thus, each S 


served as his own control. 

The results were interpreted as consistent 
with the repression hypothesis and with the 
hypothesis that the Hy-P! index is a measure 
of repressors. The results were also discussed 
in relation to the drive-anxiety hypothesis 
and in relation to the implied-failure technique. 
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INTRA-INDIVIDUAL CONSISTENCY IN CONCEPTUALIZATION 


LOUIS J. MORAN 
University of Texas 


LAURENCE S. McGAUGHRAN! anb DON LEVENTHAL? 
University of Houston 


His study is concerned with the general 

problem of whether individuals tend to 

exhibit a characteristic approach that is 
consistent and predictable on a variety of 
conceptual tasks. Both normal and psychiatric 
groups are used, not for purposes of intergroup 
comparison, but to examine intra-individual 
consistency in the conceptual behavior of 
subjects, whether normal, schizophrenic, or 
brain-damaged. 

For the present purpose, a concept may be 
defined as a principle under which a variety of 
particulars may be grouped. These principles 
may be characterized in a number of ways. 
For example, a concept may vary in its degree 
of ‘‘publicness,” i.e., the degree to which the 
principle covering a grouping is shared and 
freely communicated by the majority of 
persons within the same subculture. Thus, a 
subject who collects several three-sided figures 
under the concept triangles is using a more 
“public” principle than a subject who collects 
the same figures under the relatively private 
concept, pleasing. In other words, a concept 
may vary in the extent to which it permits 
public prediction of the objects or events which 
it is intended to categorize. 

Concepts may be classified also in terms of 
the range or order of objects or events which 
may be grouped under them. Thus, a subject 
who characterizes a group of cattle as organ- 
isms is using a higher order of classification 
than a subject who uses the concept animals; 
the latter, in turn, is using a higher order of 
classification than a subject who specifies 
cows. In this instance, the concept organism 
may be called highly “open” in relation to 
the concept cows in terms of the range of other 
objects that potentially might be included 
under the principle used. 

Concepts may, of course, be classified in a 
number of other ways. The present analysis, 


1 Employed part-time at the Houston V. A. Hospital 
where part of this study was conducted. 
* Now on the staff of the Topeka State Hospital. 


however, involves only the two variables 
mentioned, “‘publicness-privateness” and 
“openness-closedness.” This schema delineates 
four “conceptual areas” which are represented 
by the quadrants formed through the inter- 
section of the two variables. ‘Thus, concepts 
may be classified as public-open, public-closed, 
private-open, or private-closed. It has been 
shown that by means of this schema, such 
groups as the brain-damaged, the schizo- 
phrenic, and the intellectually bright and dull 
may be reliably differentiated (1, 2, 3). 

The present study is designed to test the 
hypothesis that individuals evidence a con- 
sistent tendency to operate within a specific 
quadrant in responding to a variety of con- 
ceptual tasks. Such a consistent tendency 
constitutes a conceptual trait, implying @ 
general and predictable behavior potential 
toward the solution of conceptual problems. 


METHOD 


Subjects 


The initial sample consisted of 77 subjects (Ss). OF 
these, 40 were patients in the Houston Veterans Ad- 
ministration hospital, undergoing treatment for Or T 
cuperating from nonpsychiatric illnesses which wou 
not affect in any known way their performance on con: 
ceptual tasks. A second group of 37 Ss were backwar 
psychiatric patients in the Waco Veterans Administra- 
tion Hospital, all diagnosed as chronic paranoid schizo- 
phrenics. 


Procedure 


In a study conducted several years ago by the senior 
author (4), a battery of seven word-meaning tests W25 
administered to these Ss in order to test several by- 
potheses concerning the schizophrenic’s understanding 
and use of word meanings. Their performance on two 
of these tests, a 25-item synonym test and a 17-item 
similarities test, is reanalyzed here. The new scoring 
procedures, based upon the conceptual-area schem® 
were derived a priori. Each item was scored on every 
test before proceeding to the next item. The consistency 
with which each S$ used a given conceptual area on 
these two tests and on an object sorting test, which was 
administered at the same time, was tested. Š 

Sorting test. The sorting task was administered 12- 
dividually in accordance with the directions given 
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Rapaport (5). In brief, the technique involved the 
sorting of 33 objects familiar in everyday experience. 
One part consisted of seven tasks requiring S to group 
with Some particular object the others that “belong 
with it.” The second part consisted of twelve tasks re- 
quiring S to identify a basis for the grouping of a num- 
ber of objects arranged by the experimenter. 

Each of the responses to the sorting test items was 
assigned to one of the four conceptual areas: public- 
open, public-closed, private-open, _ private-closed. 
Publicness-privateness was scored in terms of the extent 
to which an individual concept permitted potential 
public prediction of limits. For example, the boundaries 
for a concept of “redness” are rather clearly denotable; 
the conceptual boundaries for “attractiveness” would 
be considerably less so. The variable of “openness- 
closedness”’ was scored in terms of the number of com- 
mon attributes which were used or implied in each con- 
ceptual definition, For example, the classification of a 
group of objects as “all are red” is “open” relative to 
such a classification as “all are round and red and rub- 
ber.” This latter concept binds together more of the 
perceptible attributes of the objects and, hence, limits 
the number of other objects that might potentially be 
included in the same domain. An extended treatment 
of this scoring system is provided by McGaughran and 
Moran (2). Interjudge agreement, based upon 55 ran- 
domly selected items, was 93 per cent. 

Similarities test. The similarities test was composed 
of 17 groups of two to four words which may be cate- 
gorized under a single concept; for example, boat, train, 
car: “all modes of transportation.” The instructions 
were as follows: “Now I am going to read you some 
groups of words. I would like you to tell me what the 
words in the group have in common; that is, in what 
Way they are alike.” 

A response was scored as “public 
Words into an appropriate category such as doo! 

openings”; or described a meaningful similarity among 
the words like “both part of a house.” A response was 
scored as “open” if § responded with a concept that 
Was of a higher order of classification than any of the 
individual stimulus words; for example, door-window, 
‘openings.” Interjudge agreement, based upon 200 
randomly selected items, was 86 percent. —— | 

Synonyms test. The synonyms task was given in two 
Parts. The first part consisted of 25 very familar words 
Which were read one at a time to the $ under the fol- 
lowing instructions: ‘Tell me all the words you can 
think of that have the same meaning as this word. For 
example, words having the same meaning as little are: 
Small, tiny, diminutive, and so on. Now, how many 
Words can you think of that have the same meaning as 
the word house, etc.?” 

Performance on this part of the test p! 
Somewhere on the public-private variable. 
to each of the 25 words were scored as follows: 

(a) Score 2—gave more correct synonyms thai 

correct synonyms. 

(b) Score 1—gave equal number of correct synonyms 

and incorrect synonyms. 

(c) Score 0—gave more incorrec 

correct synonyms. : 

Categories a, b, and c represent levels of communi- 
cability in S’s responses. The scoring is relatively in- 


” if S grouped the 
as door-window, 


laced the S 
Responses 


n in- 


t synonyms than 
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dependent of both S’s brightness and the number of 
responses given. Thus, an S may give one correct syno- 
nym for each stimulus and still achieve the highest pos- 
sible score on the public-private variable. 

The second part of the synonym task places the S 
somewhere on the open-closed variable. The same 25 
words were presented. After each word, however, eight 
choice words were listed. The number of correct syno- 
nyms varied from two to five of the eight alternates. 
The incorrect alternates were associated in some way 
with the test word but were not synonyms for it, e.g., 
house-roof. Subjects were given the following instruc- 
tions: “After each capitalized word in this list are a 
number of words in small print. Underline each of the 
words in small print that has the same meaning as the 
word in capital letters. Do not guess.” 

In effect, for each word S was given eight concepts 
to “sort” and to place those together that belonged 
with the stimulus concept on the basis of synonymity. 
If he grouped most or all of the correct synonyms, he 
utilized to the maximum the concept of synonymity and 
was relatively “open.” If he grouped a few of the cor- 
rect synonyms, he was restricting the concept of syno- 
nymity and was relatively closed iż his sorting. To the 
extent that nonsynonyms were a ‘ouped, his con- 


cept of synonymity was overinclusive an Pega a 


relatively open. A measure of open-closed, then, wai 
given by simply totalling the number of concepts un- 


derlined. 1 
In short, each S performed on three different concep- 


tual problems. The first task involved the formation by 
S of conceptual groupings with concrete objects. The 
second task required the formation of a concept to serve 
as the principle for grouping words. The third task 
served as a more intensive appraisal of S’s use of a single 
principle, that of synonymity, The degree of consis- 
tency in approach to these three tasks was estimated by 
Kendall’s coefficient of concordance (W.). 


RESULTS 


Individuals in both groups were reasonably 
consistent in the nature of their solution on 
the three different conceptual tasks. On the 
public-private variable the normal group 
achieved a Wù of .47 (p < .05) and the schizo- 
phrenic group a We of .67 (p < .01). On the 
open-closed variable the W.’s were .57 
(p < .01) for the normal group and .63 
(p < 01) for the schizophrenic group. 

The public-private and open-closed variables 
appear to vary independently. Rank-order 
correlations between these two variables did 
not approach significance with either the 
normal group or the schizophrenic group. 


FoLLow-up STUDY 


In an independent study of differences in 
conceptualization between schizophrenic Ss 
and brain damaged Ss, the same measures 
were administered to 30 chronic paranoid 
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schizophrenic patients and to 30 patients with 
a diagnosis of generalized brain damage at the 
Topeka State Hospital. At the time the 
protocols were scored, the scorer was not 
aware of the hypothesis of intra-individual 
consistency. Coefficients of concordance were 
determined after analyses of intergroup 
differences were completed. 

On the public-private variable, the schizo- 
phrenic group achieved a W, of .78 (p < .01) 
and the brain damaged group a We of .67 
(p < .01). On the open-closed variable, the 
Wes for the schizophrenic group and the 
brain damaged group were .48 (p < .05) and 
-63 (p < .01), respectively. In both groups, the 
public-private and the open-closed variables 
were uncorrelated with each other. 


DISCUSSION 


In previous studies using the conceptual-area 
schema, inter-individual comparisons have 
shown wide differences in approach to con- 
ceptual problems. The measurement of 
individual differences has proved serviceable 
particularly in the discrimination of deviant 
groups, such as the psychotic (2) and the 
brain damaged (3). It was the intent of the 
present study to test for intra-individual 
consistency on a variety of conceptual tasks, 
The results of this investigation seem to 
warrant the conclusion that intra-individual 
consistency in conceptual-area performance 
is sufficient to merit the term “conceptual 
trait.” 


SUMMARY 


An object sorting task, a similarities task, 
and a synonyms task were administered 


individually to 40 nonpsychiatric medical 
patients and to 37 chronic paranoid schizo- 
phrenic patients. Performance on each task was 
scored in terms of a “conceptual area” schema, 
which placed the subject’s conceptual solu- 
tions in one of four “areas”: public-open, 
public-closed, private-open, private-closed. Re- 
sults were evaluated by Kendall’s coefficient 
of concordance. Both nonpsychiatric and 
Psychiatric subjects were found to be consis- 
tent in their approach to this variety of 
conceptual tasks, These results were extended 
to new samples of paranoid schizophrenic and 
of brain damaged subjects. It was concluded 
that the conceptual-area schema provides a 


means of identifying and measuring conceptual 
traits. 
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DISTINGUISHING CHARA 
CTERISTICS OF COLLABORAT' 
RESISTERS AMONG AMERICAN PRISONERS OF WAR es 


EDGAR H. SCHEIN, WINFRED F. HILL, HAROLD L. WILLIAMS, anp ARDIE LUBIN! 
Walter Reed Army Institute of Research 


MERICAN soldiers captured during the 
Korean conflict collaborated with the 
iced ag a yl degrees. Most 
Ana Re. aie s) committed trivial 
smaller i Sate peace petitions, but a 
“iG umber of men engaged in persistent 
oad arcs which included writing, signing, 
vaa oe signatures for peace petitions, 
other nh anti-United Nations lectures to 
distributi Vs, aiding in the preparation and 
Soren’ ion of communist propaganda, en- 
PA a fellow POWs to collaborate, 1n- 
the ee on other POWs, and generally aiding 
(10) stl in their indoctrination program 
- dt was not clear from the accounts of the 
mbes themselves whether one should seek 
ee of such collaboration in the pattern 
th resses which the men faced, or in person- 
al ana background factors in the men 
Banas ves, or in some combination of these 
‘ mstances. Many hypotheses have been 
ee varying in the degree to which they 
tribute collaboration to the so-called com- 
i “brainwashing” techniques, or blame 
on “weaknesses” in the men themselves, Or 
on weaknesses in American society (3, 5, 6 T 
8, 10). 

The present study is an a 
mine whether factors in the background and 
ea cme of the repatriates are related to 
ia conga However, it should be pointed 
soles the search for determinants 1n the 
Bat ground and personalities of the men does 
D represent a lack of emphasis in our think- 
( & on the role of the stressful environment 
ke described in references 4, 8, 11). Rather, 
ie felt that whatever personal determinants 

erge will illuminate the complex relationship 
a specific situational stresses and the 

dividual upon whom they impinge. 


ttempt to deter- 


METHOD 


Wasi 
ariables and Instruments 


T was decided to investigate backgrou 
ectual variables, and personality varia 


nd factors, 
bles. How- 


Capt. Schein, Capt. 


1 s 
At the time of the study, d 
ter Ree 


Williams, and Sp. 3 Hill were at the Wal 


ever, we could not expect to test any man for more than 
two hours, which resulted in a somewhat arbitrary set 
of decisions concerning tests and prevented an extensive 
sampling of personality dimensions. 

In order to determine whether such factors as home 
background and social status related to collaboration, 
the following biographical information was obtained 
from each man: (a) military rank, (6) whether the 
subject was in the regular army, reserves, National 
Guard, or had been drafted, (c) location of pre-service 
home community, (d) age, (e) years of education, (f) 
civilian occupation, (g) length of army service prior 
to capture, (#) length of internment, (é) religion, (j) 
whether the subject had spent most of his life in a 
large city, small city, small town, or farm, and (k) 
whether he had spent most of his life with both parents, 
one parent, or neither parent. 

In order to determine whether level of intelligence 


related to collaboration, the information and compre- 


hension subtests from the Wechsler-Bellevue Adult 


Intelligence Scale an 
abstraction) of the Shiple 
education may also be inclu 


variables) were administered. 
The selection of suitable personality dimensions 


presented a more difficult problem. Because collabora- 
tion is probably a very complex multidetermined 
behavior pattern, specific hypotheses were not formu- 
lated. However, it seemed likely that a man’s relation- 
ship to authority figures should play a large role in 
determining his response in the prison situation. On 
the one hand, it might be expected that opportunistic 
disregard for any form of authority would produce a 
pattern of collaboration. On the other hand, a rigid 
dependency on any form of authority might also lead 
to collaboration if the POW accepted his Chinese 
captor as the primary source of authority. These two 
possibilities led us to administer the Psychopathic 
Deviate Scale of the Minnesota Multiphasic Personal- 


ity Inventory (MMPI) and an authoritarian scale 
resembling the California F scale (1). The F scale 
could not be used because of the lower educational 


level of the subjects, and because a number of the F 
scale items involve content which we considered to be 
too emotion laden for the POWs (our items put pri- 
mary emphasis on intolerance of ambiguity). Our scale 
and the F scale were subsequently administered to a 
sample of 74 army enlisted men (not former POWs) 
and were found to correlate +.81 with each other.? 


d the two halves (vocabulary and 
y-Hartford test (years of 
ded with these intellectual 


chein is now at the 


Army Institute of Research. Dr. S 
Hill 


Massachusetts Institute of Technology and Dr. 
is at Harvard University. 
2To save printing costs, 
has been deposited with the ‘American Documentation 
Institute. Order Document No. 5298 from ADI Aux- 
jliary Publications Project, Photoduplication Service, 
Library of Congress, Washington 25, D. C., remitting 


this authoritarian scale 
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TABLE 1 
PER CENT OF REPATRIATES IN THE THREE CRITERION 
GRroUPs 
Test Sample Total Population 
Group 
N Percent N Percent 
Resisters 39 Sek 138 4.9 
Neutrals 608 80.1 2235 78.6 
Collaborators 112 14 470 16.5 
Total 759 100.0 2843 100.0 


As an additional general measure the short form of 
the MMPI was administered, based on a factor anal- 
ysis of the full MMPI by Welsh (2, 12). This scale 
permitted us to obtain a general index of anxiety or 
neuroticism (Welsh’s A scale) and a general index of 
repression, social inhibition, and denial (Welsh’s R 
scale). The biographical questions, intelligence test 
items, and personality scale items were mimeographed 
for group administration. 


Subjects 


The instruments just described were administered 
to a sample of 759 repatriated army POWs (22 officers 
and 737 enlisted men), or about one-fourth of the total 
2843 army POWs repatriated. These men were tested 
in groups of 10 to 15 on board three ships bound from 
Korea to the United States. As far as possible, tech- 
niques of systematic randomization were used in select- 
ing the ships and selecting the sample of men on each 
ship. Subjects (Ss) were told that the testers were 
interested only in general information about the ordeal 
of imprisonment and the kind of people who were able 
to survive it, not in the responses of particular indi- 
viduals. Testing conditions were not ideal because of 
crowded quarters, occasional seasickness, interruptions, 
and a general state of guardedness in the Ss. However, 
in the main, motivation was considered adequate by 
the psychologists administering the tests on each of 


the ships. 


Criterion of Collaboration 


The criterion information regarding collaboration 
was obtained from the army. Only those men were 
listed as collaborators against whom the army had 
reliable deleterious information and whose cases war- 
ranted some form of disciplinary action. It is quite 
possible, therefore, that some of the men classified 
here as noncollaborators or “neutrals” actually had 
collaborated to some extent. It should be noted that 
this classification is based on overt collaborative be- 
havior and does not necessarily imply any ideological 


nversion. . 
oa few of the noncollaborators were classified as 


i ir pri behavior. 
“resisters” on the basis of their prison camp ior. 
This grea? included those men whose army dossiers 
indicated that they were considered by their Chinese 


i .25 for 6 by 8 in. 

5 mm. microfilm or $1 ó by 

a ia Make checks payable to Chief, Photo- 
Saplication Service, Library of Congress. 


captors as “reactionaries” and were segregated from 
other POWs, and those men who were recommended 
by the army for decorations for their behavior in prison 
camp. 

Thus, the final criterion consisted of three categories: 
(a) the resisters, (b) the neutrals (those who neither 
resisted nor collaborated), and (c) the collaborators. 
The distribution of these three criterion groups is 
shown in Table 1. As can be seen, this distribution 
closely approximates the distribution of these three 
groups in the total population of released army POWs. 
No significant biases appear to have been introduced 
by the sampling procedures. 


Analysis 


An analysis of variance was performed on each of 
the quantitative variables. From each of these there 
was obtained both an eta coefficient, indicating the 
degree of relationship between the given variable and 
the criterion, and an F ratio, from which the significance 
level was obtained. Whenever this was significant, ¢ 
tests were performed to determine whether any of the 
criterion groups differed from each other. Because of 
the unequal numbers of Ss in these groups the ż test 
procedure was used, in spite of the fact that the prob- 
ability levels can not be interpreted exactly. However, 
the fact that we have only three groups to compare 
reduces the likelihood of a marked bias by the use of 
this technique. 

Chi squares were computed for all the variables 
which were in categorical form (nonquantitative). 


RESULTS 


The results on all the quantitative variables 
are shown in Table 2. The most striking 
feature of these results was that the resisters 
and collaborators always deviated from the 
neutrals in the same direction. Moreover, 10 
every case in which the relationship was 
significant, both of these extreme groups Were 
higher on the variable than the neutrals. f 

It will be observed that the biographical 
variables are the most discriminating: 2 
three are significant at the .001 level. Intel- 
lectual variables are fairly discriminating, an 
all but education are significant. Only one 
personality variable is significant, the 
scale of the MMPI, but its relationship to the 
criterion is higher than that of any other 
variable except length of internment. Scores 
on the authoritarian scale fail completely tO 
discriminate the groups. 

Although many of these variables are highly 
significant (frequently far beyond the cutoff 
for the .001 level), the eta coefficients are 
uniformly small. The most powerful predictor 
actually accounts for less than four per cent 
of the variance of the criterion. Hence, for 
practical purposes this is essentially a catalogue 
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of negative results: a number of variables 
discriminate significantly, but none of them 
discriminates importantly. 

Moreover, between any two criterion groups, 
even the statistical significance is not striking. 
Of the eight variables that show overall 
Significance, only five significantly differentiate 
resisters from neutrals, and six differentiate 
collaborators from neutrals. The significance 
levels of these comparisons are shown in the 
last two columns of Table 2. No comparisons 
between resisters and neutrals are shown, as 
none of these was significant. 

Among the categorical variables, all of which 
were of the biographical type, only one reached 
even the .05 level of significance when tested 
With chi square. This was type of enlistment, 
in which the same pattern appeared as in the 
quantitative variables, there being a higher 
percentage of both resisters and collaborators 
among the regular army men than among the 
draftees (officers, reservists and, National 
Guardsmen were not included in this analysis). 
Neither army rank, civilian occupation, 
religion, number of parents present in the 
home, rural vs. urban background, nor location 
of home discriminated the criterion signifi- 
cantly. Even the one significant qualitative 


TABLE 2 
RELATION OF QUANTITATIVE VARIABLES TO CRITERION 
Criterion Group Significance Level 
ii D ——— 
— Eta |" | Rvs| Cys- 
Rt} N*/ C* ratio] N | N 
Length of intern- | X |31.0126.6131.9| -19 | -001 | -001 | -001 
ment (months) |s | 5.9/11-0) 5.9 
Length of service | X |s0.4|30.3/35.5| -15 | -001 | -05 NS 
(months) s |57.8|28.1|37.0] 
Age (years) F |27 olg.1]25.2] -16 | -001 | -05 | -05 
s | 9.1] 3.9] 4.8 
Education (years) | X | 9.5] 9.2) 9-5] -06 | NS | NS | NS 
s | 2.5} 2.0] 2-1 
Information score | X |t4.4]12.3]13.3) 15 | 001 } 001 | -02 
s | 3.5] 3.7] 3.9 
Comprehension | X [10.4] 9.2] 9-7) -09 | -05 | NS | NS 
score’ s | 3.9] 3.2) 2.8 
Vocabulary score | X |22.8|20.523.1| -13 | -001 NS | .001 
s | 8.0] 7.3| 7-6 
Abstractions score | X |20-5/19.8}23.1] -11 | -0t NS | .0% 
s {10.3} 9.7/10.2 
MMPI Pascale | X |18.2|16.7\18.9] -19 | -001 | .05 | -001 
s | 4.3] 3.9] 5-4 
MMPI A scate | X {11.2] 9.8/11.0] .08 | NS | NS | NS 
s | 7.4] 6.5] 6.7 
MMPI R scale & l16.3116.9116.8| -03 | NS | NS | NS 
s | 5.1] 4.7] 4.7 
Authoritarian scale! X |25.1|25.5)25-1| -03 | NS | NS | NS 
s | 4.1) 5.1] 4.9] 


* R stands for resisters, N for neutrals, and C for collaborators. 


TABLE 3 


RELATION OF QUANTITATIVE VARIABLES TO CRITERION 
AMONG MEN INTERNED 31 MONTAS AND OVER 


Criterion Group Significance Level 
Variable Eta 
F |Rysj Cvs. 
RIN] C ratio 4 
19 | .001 | .05 | NS 


Length of service 
(months) 


Age (years) 22 | .001 | .001 | .05 


Education (years) 15 | .01 | NS | .01 


Information score 21 | .001 į .001 | .01 


Comprehension 15 | .01 | .05 | .05 
score 


Vocabulary score 20 | .001 | .02 | .001 


Abstractions score 15 | .o1 | NS | .O1 


MMPI Pd scale 18 | .001 | .05 | -01 


MMPI A scale t1.1/10.2/11.1) .06 | NS | NS | NS 
7.9] 6.5) 7.1 

MMPI R scale 16.4|17.016.6, .0f | NS | NS NS 
5.1] 4.6) 4.9) 


own ai ie ae ae ie ie ie ie p | 


variable, type of enlistment, was probably 
artifactual, as will appear later. 
Because lengik of internment, which is 
essentially a situational variable, was so high 
on the list of predictors, it seemed that a purer 
measure of the effect of the other variables 
might be obtained by holding it constant. 
We therefore re-analyzed a number of variables 
within the subsample of 457 men interned 31 
months and over (i.e., those captured in the 
first half-year of the war). Table 3 shows the 
results of this re-analysis. For no variable 
is the trend changed, and for the majority of 
variables the magnitude of the relationship 
is somewhat increased. Education, in particu- 
lar, now becomes statistically significant. 
Moreover, for the first time, there appears a 
significant difference between resisters and 
collaborators (not shown in the table) on one 
variable, age. The resisters are older, on the 
average, than the collaborators. All the etas, 
however, are still quite small and of no practi- 
cal value. Scores on the authoritarianism 
scale were not re-analyzed because of its 
failure to discriminate in the total sample. 
Re-analysis of the one significant categorical 
variable (enlistment type) in the restricted 
sample proved to be trivial, because only one 
per cent of the long-interned enlisted men were 
draftees, making the use of chi square both 
impossible and superfluous. The apparent 
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difference between enlistment types on the 
criterion is thus easily explained by the longer 
average internment of regular army men than 
of draftees. 

A number of intercorrelations were found 
among the predictors. The intellectual vari- 
ables tended to be positively related both to 
age and length of service (themselves cor- 
related, of course), and to the Pd scale of the 
MMPI. However, a covariance analysis 
indicated that the Pd scale still discriminates 
almost as well when any of the intellectual 
variables is held constant. The information 
score also remains significant when age is 
held constant. In view of the small predictive 
power of all the variables, a more complete 
analysis of their interrelations did not seem 
worthwhile. 


DISCUSSION 


Perhaps the most important finding in this 
investigation is negative—none of the variables 
tested predicted the criterion to any useful 
extent. Thus, our results argue against any 
simple explanation of resistance or collabora- 
tion in terms of easily measured background 
or personality variables. For practical purposes 
this study might best be summarized by saying 
that resisters, neutrals, and collaborators do 
not differ appreciably on any of the variables 
investigated. 

Particularly striking is the failure of our 
authoritarianism scale to discriminate the 
groups. Apparently, the kinds of personality 
dimensions measured by this instrument are 
not related to prison camp adjustment, unless 
one assumes that the repatriates were more 
guarded and inhibited in answering these 
items than MMPI items. As we stated pre- 
viously, we deliberately attempted to remove 
items that might arouse defensiveness, but we 
have no way of knowing the extent to which 
this effort was successful. 

Similarly, occupation, number of parents, 
location of home, rural-urban origin, and re- 
ligion failed to discriminate the groups, though 
it is possible that the failure to obtain signifi- 
cant differences may be due to the lesser 
power of the chi-square test. 

_ Nevertheless, a number of statistically 
significant differences were found, and theoret- 
ically these pose interesting problems. As may 
be recalled, it had been our hunch that col- 
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laboration might be related to opportunistic 
disregard of authority. We hoped to measure 
this factor by means of the MMPI Pd scale 
and obtain the expected relationship. But why 
should resisters also have come out high on 
this scale? This finding suggests that we may 
have to interpret the Pd scale somewhat 
differently. The scale was standardized on 
psychopathic deviates, who are characterized 
by several major adjustment tendencies such 
as difficulty with authority figures, delin- 
quency, opportunism, and under-inhibition of 
impulses. In the absence of other information 
it is difficult to rule out any of the above tend- 
encies as explanations for behavior in prison 
camp. However, it seems reasonable to suppose 
that since both resisters and collaborators were 
high on Pd, it is the under-inhibition of im- 
pulses that is common to these two groups. 

Those men who were not able to inhibit 
impulses when under stress may have been 
forced by their own acting-out tendencies into 
taking some kind of stand with respect to 
Chinese indoctrination pressures: they either 
gave in to them or arbitrarily resisted them. 

It will also be recalled that both resisters 
and collaborators tended to be older, better 
educated, more intelligent, and more exper- 
enced in the army. This combination of factors 
suggests that “self-confidence” may have been 
an important element in determining whether 
a man would take some action: either col- 
laboration or resistance. If this explanation 1S 
correct one would also expect the extreme 
groups to have higher rank. In fact, the re- 
sisters and collaborators did tend to have 
higher rank, but the relationship did not 
reach statistical significance. 

These two factors, acting-out tendencies and 
self-confidence, suggest that the underlying 
behavioral continuum to which the various 
predictors are related is not resistance-collabora- 
tion but action-inaclion. Consistently, thé 
chief contrast is between the neutrals, wh? 
defended themselves with a wall of apathy 
(as described psychiatrically in 11), and the 
two other groups, both of which were charac- 
terized by some positive action toward the 
Communists. This action might take the form 
of open resistance, of pseudo-collaboratio? 
which it was vainly hoped could be used as a 
undercover means of resistance, or of genuine 
collaboration regarded as the best way ° 
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dealing with an otherwise hopeless situation. 
TI hose men who were low in self-confidence or 
in acting-out tendencies would be more 
hesitant to commit themselves either to open 
resistance or to collaboration, but would tend 
to follow the passive middle road of the ma- 
jority. 

A recently completed study of former 
prisoners of war (9) indicates that a higher 
percentage of resisters and collaborators than 
of neutrals had been entertainers, were 
athletically inclined, and were married. If 
these findings can be interpreted as reflecting 
tendencies to respond actively rather than 
passively to situations, they support our 
interpretation. 

The degree to which ideological change 
was the basis of collaboration is difficult to 
assess, because none of the tests used here 
measured such a variable, nor was there 
evidence concerning a man’s political attitudes 
Prior to capture. Tt should be noted that there 
1S no necessary relationship between collabora- 
tion and ideological change. It is possible that 
ideological change preceded and caused col- 
laboration, or that it followed as a consequence 
of collaboration, or that the two were entirely 
unrelated, or that both are a function of some 
other determinant. Furthermore, data from 
interviews obtained by the senior author (10) 
suggest that very few, if any, of the men under- 
went a significant degree of ideological change. 
Our results are consistent with the interpre- 
tation that ideological change was not an 
important determinant of prison camp be- 
havior. Instead, it appears that the tendency 
of a person to deal with a stressful situation 
either actively or passively is a more important 
determinant of his course of action. 


SUMMARY 


A sample of 759 American army prisoners 
of war was given a series of biographical 
questions, intelligence scales, and personality 
inventories shortly after their repatriation 
from communist prison camps in Korea. 
Comparisons were then made among those 
men who had (a) collaborated with the Com- 
munists, (b) actively resisted them, and (c) 
taken a “neutral” course between these two 
extremes. Resisters and collaborators were 
both significantly higher than neutrals in 


length of internment, length of service, age, 
intelligence, and the Pd scale of the MMPI, 
but resisters and collaborators did not differ 
significantly from one another. No significant 
differences among the groups were found in 
army rank, civilian occupation, religion, 
location of home community, or number of 
parents present in the home. The A and R 
scales from the MMPI and a questionnaire 
designed to measure “authoritarian” attitudes 
also failed to discriminate. Although differences 
obtained were too small to be of any practical 
value, they are of theoretical interest and were 
interpreted in terms of an activity-passivity 
hypothesis concerning response to stress. 
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in social motivation on a reaction 
time task. It was found that biological moti- 
vation aroused by means of shocks to a 
reacting finger tended to eliminate the charac- 
teristic slowness and variability of schizo- 
phrenic performance. Some of the more 
disorganized schizophrenics, however, were 
unable to improve significantly even under 
shock motivation. The presence of this lessened 
biological reactivity in some of the patients 
led to the rejection of the postulate of a unitary 
deficit in the social motivation’ of schizo- 
phrenics and to the consideration of a multiple 
deficit conception of schizophrenia. Specifi- 
cally, it was hypothesized that lessened 
biological reactivity constitutes a dimension 
of schizophrenic motivation that is relatively 
independent of the previously reported deficit 
in social motivation. 

The present experiment was designed to 
explore this hypothesis further by investigating 
the relationship of age and privilege status to 
reaction time under shock motivation, If it 
could be shown that slow reactions to shock 
are related to age and not to poor socialization, 
defined as a lack of ground privileges, then 
defective biological motivation would be 
partially identified as a relatively independent 
dimension of schizophrenic deficit, Similarly, 
the independence of the dimension of defective 
social motivation would be indicated by a 
relationship between privilege status and 
reactions to social stimulation, essentially 
unaffected by age. Finally, it was hypothesized 
that young, privileged patients may not 
exhibit deficit on either of these dimensions, 
but may be differentiated from normals by 
displaying defective reactivity to unpredictable 


stress as a reflection of disabling anxiety 
motivation. 


PREVIOUS experiment (9) presented 
A evidence of a schizophrenic deficit 


1 This study was supported by a grant from the 
Research Division, ichigan Department of Mental 


2 Now at Michigan State University, 
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In order to test these hypotheses, schizo- 
phrenic subjects (Ss), subdivided for age and 
privilege status, and a group of normal Ss were 
tested on a reaction time task in which they 
were required to lift a finger from a telegraph 
key in response to a buzzer. All Sswere tested 
under shock, anxiety, and social motivation 
conditions. In addition, the Henmon-Nelson, a 
group paper-and-pencil intelligence test (4), 
was administered to all Ss to obtain some 
indication of their current mental status. 
Since Shakow (11) and others (10) have 
suggested that paper-and-pencil intelligence 
tests indicate cooperativeness and the ability 
to sustain attention when used with schizo- 
phrenics, the Henmon-Nelson scores were 
employed as an index of general mental 
efficiency in assessing the construct validity of 
the reaction time measures of motivation: 
The reaction time and Henmon-Nelson tests 
were later readministered to the patients to 
determine the reliability of these indices 
Within a schizophrenic population. It was 
hypothesized that the reaction time scores 
obtained under each of the motivational 
conditions would be significantly related to age, 
privilege Status, and the normal-patient 
classification in the manner indicated. 


Metuop 
Subjects 


Forty female schizophrenic patients from Northville 
State Hospital were employed as the experimental f , 
since females had shown the least improvement W1 o 
shock motivation in the previous experiment. Ta 
conditions of age and privilege status were arranged 
a factorial design by subdividing 20 patients over e 
years of age and 20 under 40 years into two ogn 
groups, each consisting of 10 patients with aie 
Privileges and 10 without privileges. Ten college ae 
dents from Wayne State University, five men and > 
women, served as normal controls. Since young ree 
dents, particularly males, are likely to perform bE 

an schizophrenic females on a reaction time Sarn 
(9), this control group was employed primarily t 
obtain an index of optimal functioning against whic 
the schizophrenic groups could be compared. The mea? 
ages in years for the groups were as follows: unpriv! 
leged-old 49.7, Tange 43-56; privileged-old 51.2, range 
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41-59; unprivileged-young 28.7, range 18-37; privi- 
leged-young 28.8, range 23-35; normal controls 20.2. 
range 17-25. , 


Apparatus 


The reaction time device previously described (9) 
was modified so that the stimulus presentation, re- 
cording, and response units could be housed in a port- 
able wooden cabinet 2414 in. long, 12 in. wide, and 12 
in. high. The response unit consisted of a telegraph key 
and a copper plate mounted on a folding panel hinged 
to the S’s end of the cabinet. When the response panel 
was opened flat on a table, a 3 x 3 in. opal flash screen 
mounted on the front face of the cabinet was visible. 
The illumination of this screen by a 10-w. bulb inside 
the cabinet served as the ready signal. 

The ready signal was presented by experimenter’s 
(E’s) closing a push-button type switch. Depression 
of the telegraph key by S in response to the ready sig- 
nal activated a Hunter Decade Interval Timer and 
Closed a holding relay, permitting Æ to release his 
switch. At the termination of the interval, previously 
set on the interval timer by Z, a door-bell buzzer and 
Standard Electric Timer were instantaneously acti- 
vated. Release of the key in response to the buzzer 
broke the common circuit to the buzzer and the clock. 
Reaction times were read directly to the nearest 01 
sec. The clock, interval timer, and all switches were 
mounted on the rear face of the cabinet, and were 
accessible when a folding panel was opened. 

The shocks, generated by a Harvard Inductorium 
located inside the cabinet and wired in parallel with 
the buzzer and the clock, were administered through 
on the surface of the response key 
te on which S rested his hand. The 
inductorium was adjusted to transmit a 50-v. current, 
the setting of a silent mercury switch determining 


whether or not the shock would accompany the buzzer. 


Procedure 
All Ss were instructed that they were taking a re- 
action time test requiring them to depress the key i 
the light signal and to release it as rapidly as possib. e 
st series consisted of 


at the sound of the buzzer. The test $ l 
80 successive trials involving social, anxiety, and 


shock motivation presented in that order. 

Social motivation. The Ss were required to respond 
to the buzzer presented at a constant 10-sec. prepara- 
tory interval for 15 successive trials, followed by 10 
additional trials at this interval during which S was 
told his last reaction time and asked to improve his 
score. The mean of the first 15 reaction times served 
the individual social motivation scores. 

Anxiety motivation. The Ss were instructed to be 
prepared for long and short ready intervals as well as 
occasional shock, and then presented with 20 trials in 
which the 1- and 10-sec. intervals occurred equally 
often in a randomized order. Twenty more irregular 
trials in which shock was presented on three of the 
ten 1-sec. trials and three of the ten 10-sec. trials in 
ble fashion followed immediately. The 
n reaction times on the 1-sec. nonshock 

motivation score because 
ese trials would be most 


a copper electrode 
and the copper pla! 


an unpredictal 


mean of the seve! 
trials served as the anxiety 


it was anticipated that th 


likely to reflect disruptive anxiety in long reaction 
times. 

Shock motivation. The Ss were informed that they 
would feel skin stimulation at the same instant the 
buzzer sounded on all of the last 15 shock trials pre- 
sented at the regular 10-sec. interval. The mean of 
these 15 reaction times served as the shock motivation 
score, to be compared with the 15 trials of the test 
series given under social motivation. 

Form B of the Henmon-Nelson Test of Mental 
Ability for college students was administered to all 
patients during the same week that they were tested 
for reaction time. The patients were tested in small 
groups of less than 5 persons and aided in understand- 
ing the directions in order to maximize their perform- 
ance on the test. The reaction-time and Henmon- 
Nelson tests were readministered two to four weeks 
from the original testing to all patients then available. 
Of the total sample, five of the privileged-young, two 
of the unprivileged-young, and one of the unprivileged- 
old patients were unavailable. Henmon-Nelson scores 
were available on the normal controls from a class- 
room administration; no retest was conducted. 


RESULTS 


The effects of the three conditions of 
motivation upon reaction time in the schizo- 
phrenic and normal groups are presented in 
Table 1. Since a few extreme cases in the 
unprivileged-old schizophrenic group tended to 
influence the group means unduly, medians 
are also shown in the tables to permit an 
evaluation of the group trends relatively 
unaffected by extreme scores. Statistical 
analyses of the group differences have also 
taken the heterogeneity of this group into 
consideration, square root transformations 
of scores being employed where the group 
variances were not homogeneous. While the 


TABLE 1 
REACTION Ties IN .01 Sec. or SCHIZOPHRENIC AND 
NORMAL GROUPS UNDER DIFFERENT CONDITIONS 
or MOTIVATION 


Unprivileged Privileged 
Schizophrenics Schizophrenics 
Old Young Old Young Normals 

Shock Motivation 
Mean 19.2 15.1 17.0 13.7 12.0 
Mdn. 18.5 14.0 16:5 13.5 12.0 
SD 62 4,2 44 2.3 1.6 
Social Motivation 
Mean 146.8 38.3 35.6 23.1 18.8 
Mdn. 84.5 34.5 33:0 22:5 18.0 
SD 161.1 20.0 16.3 16.8 2.6 
Anxiety Motivation 
Mean 124.7 51.5 73.5 32.7 24.8 
Mdn. 96.0 45.0 60.0 32.5 24.5 
SD 99.8 26.7 41.6 9.4 5.5 
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normal group was not comparable to the 
schizophrenic groups in age or sex composition, 
their average scores are shown in the tables, 
and comparisons have been made where it 
was intended to match schizophrenic Ss against 
an optimal level of normal functioning. 

Shock motivation. The hypothesis that 
older schizophrenics show a reduced capacity 
to react rapidly to biological shock motivation 
appears to be clearly supported by the data 
shown in the first two rows of Table 1. Anal- 
ysis of variance of these scores revealed that 
age was a significant source of variance at the 
-05 level, while privilege status did not signifi- 
cantly seem to affect reaction time. There 
was no significant interaction between age and 
privilege status. A separate £ test showed that 
the normals were not significantly faster than 
the privileged-young patients. 

Social motivation. The middle two rows of 
Table 1 show that the older patients and those 
without privileges reacted more slowly under 
social motivation. Analysis of variance indi- 
cated that both these sources of variance were 
significant at the .05 level; their interaction 
was not statistically significant. 

Although these findings are not consistent 
with the hypothesis concerning the social 
motivation dimension, the data of both the 
present and the previous studies showed that 
the patients who could not react rapidly to 
shock were also unable to react rapidly to 
social instructions. Therefore, an index of 
social motivation was computed for every § 
which takes into account his biological capacity 
to react as indicated by reaction time under 
shock. 

These social efficiency percentages, based 
on the ratio between speed under social 
motivation to speed under shock motivation, 
are presented in Table 2. The analysis of 
these data revealed that the privileged patients 


TABLE 2 


SocraL EFFICIENCY PERCENTAGES oF SCHIZOPHRENIC 
AND NORMAL GROUPS 
= Unprivileged Privileged o 
Schizophrenics Schizophrenics 
Old Young Old Young Normals 
Mean 31.2 45.6 56.3 60.0 63.0 
Mdn. 26.0 45.0 49.0 59.5 63.5 
SD 17.5 16.7 27.9 8.6 7.5 
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TABLE 3 
HENMON-NELSON Test SCORES OF SCHIZOPHRENIC 
AND NORMAL GROUPS 


Unprivileged Privileged 
Schizophrenics Schizophrenics 
Old Young Old Young Normals 
Mean 7.2 16.6 14.0 27.5 47.1 
Mdn. 6.0 18.0 12.5 22.5 46.5 
SD 5.3 11.2 10.6 12.6 7.4 


obtained significantly higher percentages than 
the unprivileged patients, but that age did not 
significantly affect these scores. All the normals 
and 8 of the 10 privileged-young patients 
showed greater than 50 per cent social effi- 
ciency: these two groups were not significantly 
different on this measure. These results are in 
line with the social motivation hypothesis. 

Anxiely motivation. In keeping with the 
previous findings, it may be seen from the 
last two rows of Table 1 that the groups re- 
acting slowly to shock and social motivation 
were also slowest under anxiety motivation. 
Age was a significant source of variance at the 
-01 level, while privilege status and the 
interaction were not significant. Furthermore, 
the privileged-young patients, who reacted at 
the normal level under shock and_ social 
motivation, were Significantly slower under 
anxiety motivation than the normal group 
(.05 level). 

Intelligence test and correlation dala. ‘The 
current mental functioning of the groups 15 
indicated by the Henmon-Nelson scores, 
shown in Table 3. Since this form of the Hen- 
mon-Nelson test was standardized for college 
students and required concentration in follow- 
ing detailed clerical directions, the scores 
obtained by the patients might be expected tO 
reflect both the social Motivation to cooperate 
and the capacity to sustain attention. Analyses 
of these data revealed that youth, privilege 
Status, and membership in the normal grouP 
Were all significantly associated with highe! 
scores, 

The relationship of these mental efficiency 
Scores to the reaction time indices of motiva- 
tion is shown in the intercorrelation matrix 
Presented as Table 4. The correlations indicate 
that Henmon-Nelson scores were significantly 
related to all three measures. Shock and 
anxiety reaction times were also significantly 
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related; but social efficiency percentages were 
not significantly related to either of the other 
reaction time measures. Furthermore, it may 
be seen from this table that all these indices 
had useful retest reliabilities for measurement 
purposes; shock reaction time and Henmon- 
Nelson scores being highly stable. 
mS eee of the fact that the Henmon-Nelson 
Pe pf sree to provide an index of general 
ba efficiency significantly related to the 
ee measures, a multiple correlation coeffi- 
bar was computed to determine the amount 
ariance in Henmon-Nelson scores that 
could be accounted for by all three reaction- 
pi scores. This multiple R was equal to .57, 
= ounting for 33 per cent of the variance in 
tenmon-Nelson scores. The multiple regres- 
Slon equation was: 


Y’ = 20.17 — .74X, + .24X2 — 02X; 


where 
es r 
Y’ = predicted Henmon-Nelson score 
X1 = shock reaction time 
ce 2 = social efficiency percentage 
3 = anxiety reaction time 


It is apparent from the regression weights 
and the beta coefficients obtained that most 
of the variance in mental efficiency in this 
Sample of schizophrenics was attributable 
to the shock motivation score (12 per cent) 
and the social efficiency percentage (17 per 
cent), Anxiety reaction time, apart from that 
Component related to shock reaction time, 
contributed less than four jer cent to these 
Schizophrenic mental efficiency scores; but 


TABLE 4 
Rertasmrres AND INTERCORRELATIONS or REACTION 
Tam AND HENMON-NELSON SCORES IN 
ScH1zOPHRENICS* 


social] Anxi- | yten- 


Shock reaction time an 79 


Social effici 

z ciency percentage sr . 

Anxiety reaction time ‘64|—.26] -14 
enmon-Nelson score —.39 


son r’s based on the scores 
itial test and the retest. 
r’s based on initial test 
f 31 is required 


af Bpa reliabilities presented are Pears 

the 32 patients available for both the in! 

see Temaining intercorrelations are Pearson 

for pas of all 40 patients. For 38 df a Pearson r 0 
e .05 level of confidence. 
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would probably contribute appreciably to the 
variance of a distribution composed only of 


early schizophrenic scores. 
Discussion 


The results of the present experiment suggest 
that three relatively separate dimensions of 
motivational deficit may be identified among 
the schizophrenias. There seems to be a class 
of schizophrenics who show a defective capac- 
ity to respond to biological stimulation in 
addition to the more typical reduction in 
social motivation. These patients tend to 
be older and to have poorer mental efficiency 
but, within the limits of their reduced bio- 
logical reactivity, they are as socially efficient 
as other schizophrenics. Such patients may 
well show the quantitatively subnormal 
adrenal responses to stress observed by Hoag- 
land (5) in chronic schizophrenics. It would 
also be profitable to investigate whether such 
psychotics display a significant number of 
organic signs on the Rorschach. An expectation 
of this sort is supported by the finding of 
Brackbill and Fine (2) that typical process 
schizophrenics are indistinguishable from 
organics in the incidence of Piotrowski’s ten 
signs of organic involvement. 

A second class of schizophrenics seems 
unimpaired in biological motivation, but the 
reactivity of these patients to social incentives 
is typically well under 50 per cent of their 
reactivity to shock. The extent of the deficit 
in social motivation appears unrelated to age 
but is reflected in a lack of ground privileges 
and poorer mental efficiency. This group illus- 
trates the social disarticulation process de- 
scribed by Cameron (3) and others (12) in the 
typical disturbances in reaction time reported 
originally by the Worcester group (6, 8) and 
more recently by King (7). King, however, 
has rejected the motivational interpretation 
of slowed reaction times, suggesting rather 
that this symptom is “. . . a primary indication 
of disturbance in a basic adaptational process, 
that of motor adjustment to the external 
surround.” The findings of both the present 
and previous papers that shock motivation 
eliminates this primary disturbance in motor 
adjustment, at least in some patients, makes 
his interpretation very questionable. 

A third class of schizophrenics functions at 
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a normal level of efficiency under both bio- 
logical and social motivation. These patients 
are young, have ground privileges, and tend 
to display good mental efficiency. T hey seem 
to display abnormal reaction times only under 
the stress of anxiety motivation, suggesting a 
type of psychosis in which the disorder con- 
stitutes primarily a reaction to disruptive 
anxiety. While the intellectual regression of the 
first class of schizophrenics appears to resemble 
Beck’s (1) reaction Form S-1, the intellectual 
control and lack of symptoms of this third 
class may well involve the defensive character 
hardening that he has described in his Form 
S-3. 

The present findings do not indicate whether 
these three types may best be regarded as 
different forms of schizophrenia or as stages 
of the same schizophrenic process. However, 
the reaction-time technique employed appears 
to provide a brief, reliable instrument for 
assessing relatively fundamental pathology 
in motivation. It seems likely that Prognosis 
is poorer in those patients showing a deficit 
in biological motivation, better in those with 
good biological motivation and high social 
efficiency percentages. Further exploration of 
the manner in which these dimensions of 
motivation are related to different prognostic 
and etiological factors should determine 
whether the types isolated require classification 
as separate forms of psychopathology or as 
various phases of a basic schizophrenic process, 


SUMMARY 


The present experiment was designed to 
investigate the relationship of age and ground 
privilege status to reaction time indices of 
biological, social, and anxiety motivation in 
schizophrenic patients. 

Two conditions of both age and privilege 
status were arranged in a 2 x 2 factorial 
design by subdividing 20 schizophrenics over 
40 years and 20 under 40 years into two equal 
groups, each consisting of 10 privilege and 10 
ward patients. Ten college students served as 
normal controls. 

All Ss were tested on a reaction time task 
requiring them to lift a finger from a telegraph 
key in response to a buzzer during 80 successive 
trials, successively involving social instruc- 
tions, unpredictable anxiety cues, and shock 


stimulation. The Ss were also given the Hen- 
mon-Nelson test of Mental Ability and retested 
on both the reaction time and intelligence 
tests. 

The results showed that age significantly 
slowed mean biological and anxiety reaction 
times but did not affect the social efficiency 
percentage based on the ratio of speed under 


Social motivation to speed under shock 


motivation. Privilege status significantly im- 
Proved mean social reaction time and social 
efficiency percentage but had no effect on the 
other two measures and did not interact with 
age. The normals were superior to the schizo- 
phrenics on all measures, but could be signifi- 
cantly differentiated from the younger, 
privileged group only on anxiety reaction time. 

Correlational data were presented showing 
that shock reaction time was correlated with 
anxiety reaction time but independent of social 
efficiency percentage in the schizophrenic 
sample. Mental efficiency was significantly 
correlated with the three reaction time indices, 
and all four measures had stable retest relia- 
bilities. 

It was concluded that lessened reactivity 
to biological stress with increasing age, reduced 
responsiveness to social demands, and inability 
to control anxiety are three separate dimen- 
sions of motivational deficit which may be 
isolated in schizophrenic patients. It is sug- 
gested that these dimensions may be related 
to different etiological and prognostic factors, 
and may prove useful in the classification of 
schizophrenic pathology. 
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THE EFFECTS OF SEX-ROLE IDENTIF ICATION 
UPON PROBLEM-SOLVING SKILL: 


G. A. MILTON 
Stanford University 


URING recent years, there has been an 
increased interest in the effects of 
“nonintellectual” Processes upon 

problem solving and thinking. One area in 
which such effects can be and have been 
studied with readily discriminable criterion 
groups is the area of sex differences, Sweeney 
(5) has summarized several studies in which 
problem-solving differences between men and 
women were observed. He also reports experi- 
ments of his own which demonstrate that men 
solve certain classes of problems with greater 
facility than do women, even when differences 
in intellectual aptitude, academic training, 
and special abilities are controlled. 

The present study explores the possibility 
that the differences in problem-solving skill 
between men and women may be due, at 
least in part, to a set of learned behaviors 
that constitute a culturally defined sex role. 
A simple extrapolation from the earlier studies 
of sex differences leads to the hypothesis that 
the more an individual identifies with the 
masculine sex role, the greater will be his 
problem-solving skill. 

This hypothesis may be stated more ex- 
plicitly: (a) There is a positive relationship 
between masculine identification and problem- 
solving achievement. (6) When an adjustment 
is made for the between-subjects variance 
contributed by sex-role identification, sex 
differences in problem-solving achievement 
will be reduced. The implication of the hypoth- 
esis is, of course, that not only will differences 
between men and women in problem-solving 
skill be reduced by an adjustment for mas- 
culine role identification, but also that the 

more “masculine” men will solve problems 
more readily than “feminine” men and, 
similarly, that the more masculine women 


i riment is one in a series of studies of 
e ue under Project NR 150-149 and 
r ted by Contract N6onr 25125 between Stanford 
eaii and the Office of Naval Research. Work on 
E se was under the general direction of Dr. 
Id W. Taylor. Permission is granted for reproduc- 

aa analai publication, use, and disposal of this 


article in whole or in part by or for the United States 


Government. 


will be better problem solvers than women 
who have greater identification with the 
feminine role, 


MerTHOD 
Indices of Sex-Role I. dentification 


The concept of sex role refers to a heterogeneous 
collection of actions that differentiate typically mascu- 
line behavior from typically feminine behavior in a 
particular culture. These characteristics of the sex 
role may be defined restrictively in terms of overt 
behavior only, or more broadly in terms of attitudes, 
interests, needs, and other inferred constructs in addi- 
tion to overt behavior. The indices of sex role used in 
this investigation have been determined by primarily 
empirical means; overt and covert measures were used, 
the criterion for selection being that they involved 
nonintellectual factors that differentiate men from 
women for the general culture from which the experi- 
mental sample was drawn. 

The Terman-Miles M-F Test (6) was chosen as the 
primary defining measure of sex-role identification 
because it measures a broader range of responses than 
any other sex-role measure cited in the literature. 
This test consists of a set of items that differentiate 
men from women in a culture similar to that of the 
current sample (cf. below). The test elicits responses 
that not only discriminate men from women, but also 
seem to assess a culturally defined masculinity-feminin 
ity variable within each sex. Ae 

Two secondary measures of sex role were also oie 
ployed. These tests tap a narrower range of errs 
content. One, the M-F scale of the Minnesota Mw a 
phasic Personality Inventory (2), consists largely a 
self-report items referring to feelings and emotion: 
The other, the Behavioral Inventory M-F (4) scale, t 


part of an unpublished experimental piee Be 


defines the role in terms of self-reported overt be lege 
that discriminates between men and women in a C0 
Population, 


Problems 


* ex 
Twenty problems that had shown signifi me a 
differences in pretest trials were used. The ee pis 
were selected and refined by Nakamura (3), an the 
methods of administration were duplicated = the 
Present study. Item analysis has indicated tha skill, 
problems involve two types of problem-solving ai 
restructuring and straightforward solution. Iter bis 
structuring problems require the subject to a aight- 
initial set in order to attain the solution; the str Ten 
forward problems are solvable by direct meane: were 
straightforward and ten restructuring problems a o 
arranged alternately by pairs, according to © umer- 
difficulty. One member of each pair involved a iea x 
ical solution and the other member was non-num 
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The problems were bound in a booklet with one prob- 
lem per page so that each problem was presented 
individually. A working time of four minutes per 
problem was allowed with a ten-minute intermission 
after the first ten problems. Problems were scored 
either correct or incorrect with no partial credits given. 


Subjects 


i iene hundred and twenty-nine students from an 
im roductory psychology class at Stanford participated 
as subjects. Of these, 63 were males and 66 were females. 
Scores on the mathematical and verbal sections of the 
College Entrance Examination Board Scholastic Apti- 
tude Test were available for all subjects as a control 
upon intelligence. 


Procedure 


_ Subjects were tested in six groups of 16 to 35 sub- 
Jects. For all groups, the tests were administered in 
three sessions. The order of presentation of the tests 
was arbitrary. The MMPI M-F and the Behavioral 
Scale M-F were presented during the first session; the 
problem-solving test during the second session, and the 
Terman-Miles M-F during the third session. Because 
of the nature of the items on the M-F scales, it was 
assumed that there would be no interaction with order 
of presentation; therefore, this arrangement was made 
purely on the basis of experimental expediency. 


RESULTS 

The results of this investigation may be 
divided into two categories. First, sex dif- 
ferences on the problem-solving measure are 
examined in order to establish that those 
reported by other investigators (5) are found 
in the current investigation. The hypothesized 
relationship between sex role and problem 
solving is then tested. Both the relationships 
between sexes and within sexes are examined. 


Sex Differences in Problem Solving 


The differences between the problem- 
Solving achievement of men and women 
Presented in Table 1 are the data which the 
hypothesis must partially explain. The differ- 
ences between men and women are significant 
with respect to the total problem-solving 
score and all the sub-scores, but the difference 
between men and women on one subtest is not 
significantly greater than that on another 
subtest. The total score can therefore be used 
to test the effects of the sex-role measure. 
These results are similar to those obtained by 
Nakamura (3) for the same problem on a 
sample of University of California students. 
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TABLE 1 
Sex DIFFERENCES IN PROBLEM-SOLVING SKILL 


Men Women 


Type of Problem — a AEE 
Mean} SD |Mean| SD 


Problem-solving, total 9.10/3.68]7.18}3.27/3.12** 
score 


Restructuring problems [4.33]2.22/3.38/1.88 2.63" 
Straight-forward problems |4.76|2. 17/3 .80|1.79 2.74** 
Numerical problems 5.06]2.27/4.04/1.87/2.78** 
Non-numerical problems [4-05 2.15}3.14 1.79|2.59* 


* Significant beyond .02 level. 
** Significant beyond .01 level. 


TABLE 2 
CORRELATION OF THE SEX-ROLE AND APTITUDE 
VARIABLES WITH PROBLEM-SOLVING SKILL 


Variable Men Women Combined 
Terman-Miles M-F .44** .26* ALt* 
MMPI M-F 14 13 29" 
Behavior M-F E 23 .36"* 
Verbal aptitude 4ax* aar 

63** 78** 


Math aptitude 


* Significant beyond .05 level. 
** Significant beyond .01 level. 


Effects of Sex-Role upon Sex Differences in 

Problem Solving 

In order to confirm the hypothesis of this 
investigation, two effects must be demon- 
strated: that there is a correlation between 
sex-role identification and problem solving, 
and that this correlation accounts for some 
of the differences between men and women 
in problem-solving skill. The correlations 
between sex-role measures and problem solv- 
ing are presented in Table 2. 

Tt is clear that problem-solving skill is 
significantly related to sex-role identification 
as measured by the Terman-Miles test. There 
is a significant relationship both within and 
across sexes, The evidence concerning the 
secondary measures of sex-role is less clear. 
The correlation of problem solving with the 
MMPI M-F scale approaches significance 
only for the combined sex group, and the 
correlation of problem solving with the 
Behavioral M-F scale does not appear to be 
significant for the women. 

In order to determine if the differences 
between men and women in problem-solving 
score can be accounted for in terms of score 
on the measures of sex-role identification, a 
simple analysis of covariance was performed. 
The rationale of this operation is that if the 
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TABLE 3 s 
ANALYSIS OF COVARIANCE: SEX DIFFERENCES IN 


PROBLEM SOLVING WITH ADJUSTMENT FOR 
TERMAN-MILES M-F Score 


G..A. MILTON 


TABLE 6 


BETA WEIGHTS: RELATIVE INDEPENDENT CONTRIBU- 


TION OF EACH OF THE VARIABLES TO 
PROBLEM-SOLVING SCORE 


A STE After Adjustment 
Source of 
VANEK Sum of Sum of | Mean 
af Squares if Squares Squares F 
Total 128 | 1675 | 127 | 1375 
Within 127 | 1557 | 126 | 1366 | 10.84 
Between 1] 118 1 | 9.32 | 9.32 | 0.86 
TABLE 4 


ANALYSIS OF COVARIANCE: SEX DIFFERENCES IN 
PROBLEM SOLVING WITH ADJUSTMENT 
FOR MMPI M-F Score 


Batare After adjustment 
Source of Adjustment 
Variation a Tama Sum of| Mean 
if Squares Squares | Squares 
Total 128 | 1675 | 127 | 1534 
Within 127 | 1557 | 126 | 1529 | 12.13 
Between 1] 118 1 | 5.86 | 5.86 | 0.48 
TABLE 5 


ANALYSIS OF COVARIANCE: SEX DIFFERENCES IN 
PROBLEM SOLVING WITH ADJUSTMENT FOR 
BEHAVIORAL M-F Score 


e i After Adjustment 
S f Sı f| M 
af Sauazes af Squares Satare F 
tal 128 | 1675 | 127 | 1462 
Sithin 127 | 1557 | 126 | 1454 | 11.54 
Between 1| 118| 117.76| 7.76 | 0.67 


correlation between the problem-solving score 
and the sex-role measure is sufficient to reduce 
ificantly the difference in problem-solving 
score, then sex-role identification as measured 
by the particular test accounts for a part of 
this difference. Table 3 presents the results of 
this analysis for the Terman-Miles test, and 
Tables 4 and 5 show similar results for the 
MMPI and the Behavioral Inventory. 
The results indicate that the Terman-Miles 
not only accounts for a significant part of 
the difference between men and women in 
problem-solving skill, but also that it di- 
minishes this difference to the point where 
it is no longer significant. The results with 
MMPI and Behavioral Inventory are 
— rable to those obtained with the 
ran Miles The difference between men 


sign 


Men Women 
Variable (V = 63) (N = 66) 
Beta Beta 
Terman-Miles M-F E da .183* 
MMPI M-F -074 .050 
Behavior M-F -066 -126 
Verbal aptitude .224** 099 
Math aptitude .473** .848** 


* Significant beyond .05 level, 
** Significant beyond .01 level. 


and women on the problem-solving measure 
diminishes when the variance attributable 
to sex-role identification is controlled. 

The hypothesis requires, by implication, 
that differences in problem-solving skill within 
a sex must also be partially accounted for by 
differences in sex-role identification. Table 2 
indicates that the Terman-Miles measure was 
significantly correlated with problem solving 
within both sexes, but that the other sex-role 
measures were less clearly related to problem 
solving. 

In order to determine what portion of the 
variance in the problem-solving task within @ 
sex may be explained on the basis of sex-role 
identification and the other nonintellectu® 
variables related to sex role, the vana an 
were fitted to a multiple regression equate 
and the appropriate beta weights were dete i 
mined for each variable by means of @ ee 
little solution. Since the hypothesis requiro 
that differences in problem-solving es 
must be explained independently of differen a 
in intellectual aptitude, the verbal and mat a 
matical measures were included in the ann 
tion. The results of this analysis are prese” 


in Table 6. the 
For both the men and the women, nti- 
-role ide 


Terman-Miles test measure of sex 
fication is a significant contributor to eh 
variance in problem-solving peer are 
when mathematical and verbal aptitu “ude 
held constant. The mathematical apt nce 
scale contributes a large portion of the sag? ale 
for both sexes, and the verbal aptitude ales: 
makes a significant contribution for the inde- 
The other sex-role variables make = : ob 
pendent contribution to the variance 1 onna 
lem solving, when the more powerfu 

Miles test is included in the battery- 


SEX-ROLE IDENTIFICATION AND PROBLEM SOLVING 


DISCUSSION 


The results of this study seem to indicate 
that the differences between men and women 
in problem-solving skill as reported by a 
number of investigators are merely part of a 
more general difference between men and 
women in role identification. Men have 
characteristically different behavior, attitudes, 
emotions, and motivations than do women, 
and the differences in problem-solving skill 
reflect or are a part of these differences. To 
the extent that allowance can be made for 
these differences in general sex-role identi- 
fication, the differences in problem-solving 
achievement seem to disappear. 

Moreover, there are differences within a 
sex as to the extent of identification with the 
appropriate sex role, and these within-sex 
differences in role also seem to be related to 
problem-solving skill. Men who identify 
with the masculine sex role are better problem 
solvers than are those who identify with a 
more feminine role, and conversely, women 
who identify with the appropriate sex role 
have poorer problem-solving scores than do 
women who have a more masculine identi- 
fication. All these comparisons, of course, 
assume a common level of intelligence. 

This interpretation leads to several impli- 
cations. For example, the acquisition of 
problem-solving skills may be affected by the 
same processes which govern the formation of 
sex-role identification. Tf the Freudian hy- 
pothesis concerning the identification process 
is correct, then important antecedents of 
adult problem-solving skill are to be found 
in childhood. The male child who forms an 
adequate identification with the male role, 
given sufficient intelligence and opportunity 
to learn, will grow up to be a good problem 
solver. On the other hand, the female child, 
even though possessing adequate intelligence 
and opportunity to learn, will probably not 
develop problem-solving skills (of the sort used 
in this study) if she forms an appropriate 
identification with the feminine role, because 
this type of problem solving is not appropriate 
to the female sex-role in her culture. This is, 
of course, a very broad extrapolation from 


the data of this experiment. 

Another interesting implication concerns 
the possibility for improvement in problem- 
solving skill. If the sex-role interpretation of 
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the results of this experiment is correct, any 
attempt to change problem-solving skill will 
be more effective if corresponding changes 
are made in certain aspects of the conventional 
sex-role identification. While it may not be 
necessary to change the sex-role identification 
of a subject in order to change his (or her) 
problem-solving skill, it would seem necessary 
to neutralize the effects of sex-role identifica- 
tion that relate to the problem-solving process. 
Practice on problems alone probably would 
not be effective. Carey (1) has demonstrated 
an effect similar to that of partially changing 
the problem-solving attitudes associated with 
sex-role identification. Through group dis- 
cussions of women’s attitudes toward intel- 
lectual activity, she was able to improve their 
problem-solving scores. This change, though 
significant, was slight, and it would seem 
that more than a few discussions are required 
to change attitudes that appear to be basic 
to the culturally defined sex role. If one 
considers that sex-role identification has had 
an effect on the learning of a subject for the 
major part of a lifetime, it seems that a change 
in sex role would require a long period of 
relearning. Also, if major changes in problem- 
solving performance are to be achieved, the 
relearning of sex-role would necessarily be 
accompanied by a period of relearning of 
problem-solving skills. 

Although this interpretation of the results 
seems appropriate for a college population, 
there are logical difficulties in any extrapola- 
tion of these results to the more general 
population. Wherever there are subsections 
of the general population in which problem 
solving, as defined in this investigation, is 
not appropriate to the masculine role, this 
relationship would not be expected. Since the 
studies of sex differences in problem solving 
which were mentioned earlier have been 
conducted upon college populations, there is 
little evidence as to how far these conclusions 
can be generalized. 


SUMMARY 


The hypothesis of the study was that sex 
differences in problem-solving skill may be 
partially accounted for by differences in 
sex-role identification. The Terman-Miles 
M-F Test was used as the primary index of 
sex-role identification. Two other M-F ques- 


212 


tionnaires were also employed. These scales 
and a measure of problem-solving skill were 
administered to 63 male and 66 female uni- 
versity undergraduates. In general, the results 
indicate that there is a positive relationship 
between masculine sex-role identification and 
problem-solving skill both across sexes and 
within a sex. When allowance is made for 
this relationship, the difference between men 


and women in problem-solving performance 
is diminished. 


REFERENCES 


1, Carey, G. L. Reduction of sex differences in prob- 
lem solving by improvement of attitude through 
group discussion. Dept. Psychol., Stanford 


G. A. Mitton 


Univer., 1955, (ONR Tech. Rep. No. 9, Contract 
N6onr 25125). 

2. Haraway, S. R, & McKintey, J. C. Minnesota 
Multiphasic Personality Inventory. New York: 
Psychol. Corp., 1951. 

3. Naxaaora, C. Y. The relation between conformity 
and problem solving. Dept. Psychol., Stanford 
Univer., 1955. (ONR Tech. Rep. No. 11, Con- 
tract N6onr 25125). 

4, Pryneau, S. R., & Mton, G. A. The objective 
measurement of overt behavior. Paper read at 
Western Psychol. Ass., June, 1954. 

5. Sweeney, E. J. Sex differences in problem solving. 
Dept. Psychol., Stanford Univer., 1953, (ONR 
Tech. Rep. No. 1 Contract N6onr 25125). 

6. Terman, L. M., & Murzs, C. C. Sex and personality. 
New York: McGraw-Hill, 1936. 


Received October 24, 1956. 


THE EFFECTS OF CLEAR AND UNCLEAR ROLE EXPECTATIO 
, NS 
ON GROUP PRODUCTIVITY AND DEFENSIVENESS! 


EWART E. SMITH? 
University of Colorado 


time and energy trying to develop 
z sufficient understanding of other members 
in order to feel they can predict how these 
other members will behave. In role theory 
concepts (5), this is attempting to locate the 
position of the other so that one may have 
valid role expectations of the other’s role 
enactments. The same phenomena have been 
discussed in other theoretical terms. Bales (1) 
states that the social structure of groups 
develops in order to “ . . . reduce the tensions 
growing out of uncertainty and unpredicta- 
bility in the actions of others.” 
_ The general hypothesis of the present study 
is that the time and energy used by group 
members in attempting to predict the be- 
havior of others reduces the amount of group 
energy available for any given group goal or 
task. This hypothesis is consistent with the 
conception of group energy formulated by 
Cattell (2), who states that group members 
have a finite amount of energy available in a 
group situation, and that the amount of energy 
absorbed by the internal group structure is 
subtracted from the amount available for 
group productivity. , — 
The present investigation 1s primarily 
concerned with the relationship between 
the degree to which members of a small 
group feel they can predict each_other’s 


behavior, and the group’s task efficiency. 
In addition to the positive relationship an- 
variables, it was €x- 


ticipated between these r 
pected that uncertainty regarding the roles 
Id be reflected 


of the other group members wou 
in increased defensiveness and dissatisfaction 


with the group experience. 


1 Condensed from a dissertation submitted in partial 
fulfillment of the requirements for the Degree of Doctor 
of Philosophy in the Department of Psychology, Uni- 
versity of Colorado. The writer wishes to express his 
appreciation to Professor Jack R. Gibb, under whom 
the work was done, and to Professors Maurice P. Smith 
and Michael Wertheimer for their helpful criticisms 
and advice. The work was supported by a research 
Brant (Contract Nonr-1147(03), NR 170-226) from the 
Group Psychology Branch of the Office of Naval Re- 
Search. 

2 Now with the RAND Corporation. 


T ALL groups, people invest considerable 


While these problems have not previously 
been studied experimentally, as far as the 
author can ascertain, Torrance (8) developed 
similar views from his study of 1,000 critical 
incidents of Air Force crews downed in enemy 
territory. He concluded that the two major 
detrimental factors in group survival were 
an unclear situation and an unclear group 
structure. Regarding the latter factor, he 


states, in part: 


Under stress, these linkages between members may 
become confused and thus people do not have a clear 
perception of what they can expect from one another, 
with whom they can relate, how they can relate to one 
another, and so on. The fact that this happens appar- 
ently influences the ability of the group to survive. 


In order to induce ambiguous role expecta- 
tions, paid accomplices were assigned roles 
as silent members in small groups of persons 
who were, initially, strangers to each other. 
In the initial meeting of such a group, each 
member should find the behavior of all of 
the others equally unpredictable. But in the 
course of group interaction the behavior of 
those who spoke should become more pre- 
dictable, while the behavior of those who 
remained silent should remain unpredictable. 
Eventually the group members would un- 
doubtedly come to predict with confidence 
the simple fact that the silent members would 
remain silent. Since only short-term groups 
were studied, however, the latter effect is not 
relevant to the present experiment. 

This reasoning was substantiated in several 
preliminary investigations. The presence of 
silent group members in short-term groups 
had little, if any, effect on productivity 
during the first few minutes, but productivity 
decreased to a low point at about thirty min- 
utes, followed by a gain until the presence of 
silent members resulted in no decrement in 
productivity. 

The hypotheses tested in the present study 
are: 

1. The presence of silent members in a 
group produces (a) decreased productivity, 
(b) increased defensiveness, and (c) increased 
dissatisfaction with the group experience. 
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2. Decreasing the ambiguity of role ex- 
pectations in a group that contains silent 
members counteracts the effects predicted in 
Hypothesis 1. 


METHOD 
Subjects and Task 


The subjects (Ss) were 50 males and 90 females from 
the introductory psychology classes at the University 
of Colorado. The task, taken from an experiment by 
Taylor and Faust (7), is an adaptation of the parlor 
game, “Twenty Questions,” in which the Ss had to 
identify items designated by the experimenter (E) as 
either animal, vegetable, or mineral. Each problem had 
to be solved with less than forty questions for the group 
to receive credit. There was no time limit on individual 
problems. To each question asked by a group member, 
E replied in one of the following ways: _ 

(a) Yes; (b) No; (c) Partly; (4) Sometimes; (e) Not 
in the usual sense of the word; (f) I don’t know (no 
charge for the question); (g) Please restate the ques- 
tion (if the question was unclear or could not be an- 
swered in one of the above ways). Difficult items like 
wrench, ruby, and bread were used. 


Experimental Design 


In the experimental Conditions A and B, each 
group was composed of three participating Ss, two 
females and one male, and two silent members, one 
female and one male. Condition A was designed to 
induce ambiguous role expectations. Condition B 
differed from Condition A in that the roles of the silent 
members were clarified prior to experimentation; all 
group members checked their “usual role” in a group 
on a check list. Everyone then announced the role he 
had checked, which was then checked by E on a large 
replica of the “Usual Roles” check list. The silent 
members always checked the “Listener” role, 

In the control Conditions C and D, there were no 
paid silent members. In Condition C each group was 
composed of three participating Ss, two females and 
one male. In Condition D, each group was composed 
of five participating members, three females and two 
males. 

In all conditions, Ss worked on the task for two 
periods. The first work period, which lasted fifteen 
minutes, was intended to allow ambiguous role ex- 
pectations of the silent members to develop. In the 
second work period, which: was ten minutes long, 
group productivity was measured. The problems, and 
their order, were the same for all groups. Regardless 
of how many problems the group had solved in th 
first work period, each group started with a new à 
identical, set of problems in the second work perio T 

The thirteen silent paid partici petog, 

` Snt paid participants were not known 
to the Ss in their groups. Each sile: 


i eir nt member w; 
assigned to Cond a 
. oan ition A and B §roups an equal number 


Dependent Variable Measurements 


Group productivit 
problems solved. Dep measured by the number of 


d. Defensiveness was m 
3 É i: easured 
25-item Likert scale, with a corrected, piiblit tat 
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TABLE 1 


Mepian RATING OF Ss AND SILENT PAID 
PARTICIPANTS ON ROLE AMBIGUITY 


Data under each condition 


isti A B € D 
Statele Ambiguity (Silent| Clarity (Silent |Control Control 
Ambigu- embers) Members) (N = 3)|(N = 
ity Rating) 
+ Rating + Rating i Rating 

RAHAB lof Silent | Rating | of Silent | Rating | Ra 

of Ss [Members fembers| Of 58 | of ee 
Median 11.00 | 6.00 | 11.00 8.00 8.50 | 8.00 
N 29 | 29 30 30 30 50 


Note.—Low scores indicate role ambiguity. 
H for these data is 34.89, p < .01. 


N is 29in Condition A as one S failed to fill out the rating 
form. 


ability of .79, which had been validated by the known 
group method (4). A typical item is “I felt free to 
express all my ideas in this group.” Group satisfaction 
was measured by a 15-item Likert scale, with a cor- 
rected split-half reliability of .93, which had been 
validated by the known group method. A typical item 
is “This is the most stimulating experience I’ve had in 
a long time.” A single item was used as an index of 
role ambiguity: “I can predict how this person would 
act in other situations.” The Ss rated each grouP 
member on a scale from —3 to +3. A numbering syS~ 
tem was used so that Ss did not give their names, This 


measure was used as a check on the effectiveness of the 
experimental manipulation. 


RESULTS AND DISCUSSION? 


„Table 1 contains the findings on the am- 
biguity item, “I can predict how this perso” 
would act in other situations.” Plotting th¢ 
Scores on this item revealed that norma 
distributions could not be assumed. CoM 
sequently, nonparametric rank tests bave 
been used (3, 6). An H test, which is func- 
tionally equivalent to a one-way analysis © 
variance, has been computed on the data }P 
Table 1. As this H is statistically significant, 
additional rank tests have been compute: 
Using this method, 2’s are obtained which 
are normal deviates evaluated by the use ° 
the normal probability table. The 2 Fal 
related measures) on the ratings of a 
members compared to the ratings of Ss in tpe 
Condition A groups is 4.46**, indicating th 
the independent variable manipulation : 
Condition A was successful; i.e. that th 


. : CE3 5 
silent paid participants were seen as 
predictable than the Ss. 


uag D 
? Statistical significance at the .05 level has be 


oe er 
indicated by a single asterisk; at the .01 level or bett! 
by a double asterisk. 


ee eS 
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, Additional support for the hypothesis that 
silent members were seen as less predictable 
is given by the significant z, 4.37**, between 
the ratings of silent members and Ss in the 
Condition B groups. However, comparison 
of the ratings of the silent members in Condi- 
tion A with the ratings of the silent members 
in Condition B indicates that the difference, 
although in the predicted direction, is not 
significant. ‘Therefore, the hypothesis that 
role clarification reduces the ambiguity of 
the roles of silent members is not supported. 
eon failure may be due to a temporal 
sete, At the time that the Ss rated the 
ilent members on predictability, they had a 
a of 30 minutes of observations upon 
nn to base their ratings. This 30-min. 
et ge was the same in Conditions A 
rigs and was apparently more powerful 
no the earlier one-minute role clarification 
5 ondition B, perhaps resulting in a leveling 
` ect on the comparison of Conditions A and 
with respect to the ratings of the silent 
members. Results more in line with the 
hypothesis might have been obtained if the 
rating item had been given earlier. 
Pii interesting, and unpredicted, finding is 
hat the mean rating of the Ss in both Condi- 
tions A and B is significantly higher than the 
ratings of Ss in Condition C, with 2’s of 3.24** 
and 3,64**, using a rank test for independent 
samples (3). This finding cannot be explained 
from the data, but, as a possible explanation, 
it may be suggested that in the groups con- 
taining silent members, the other Ss were 
seen by contrast as much more predictable 
than they otherwise would have been. 
Table 2 contains the findings on group 
productivity. Plotting the group task scores 
for the second work period (as anticipated, 
there were no significant differences in the 
first work period) revealed that normal 
distributions could not be assumed. Con- 


sequently, nonparametric rank tests (for 
independent measures) were used. The ob- 
d B is 


tained z between Conditions A an 
2.18*; between A and C, 2.35%; and between 
A and D, 2.63**. 

Table 2 shows that the si 
reduced group productivity significantly in 
Condition A. These results support the hy- 
pothesis that ambiguous role expectations 
reduce group productivity. At first glance, 
Some more parsimonious explanations might 


lent members 
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5 TABLE 2 
NUMBER OF PROBLEMS SOLVED, UNDE: 
R EA 
Four CONDITIONS, IN First AND Seca oR 
Work PERIODS 


Data Under each Condition 


A B c D 
Ge (Gia 
i lent 
Gi | ee: [oP 
ers, 


Statistic on Number of 
Problems Solved 


5.50 | 6.50 | 6.00 | 6.00 
3.00 | 2.00 | 2.50 


Median: First work pe- 
riod (15 minutes) 
Median: Second work | 0. 

work period (10 min- 
utes) 
Number of groups 


Note.—H for these data (second work period) is 9.29, p < .05. 
TABLE 3 


AVERAGE DEFENSIVENESS SCALE Scores UNDER EACH 
or Four CONDITIONS 


10 10 10 10 


Data Under each Condition 


Statistic on ee 
DSa | ambii Clarity ? D 
a viguity y Control Control 
aet aen] = ®) Wes 
Mean 66.00 61.53 49.17 52.74 
SD 13.92 12.41 14.14 14.10 
N 30 30 30 50 


Note.—High scores indicate defensiveness. 

‘The analysis of variance is not summarized, t 
as only predicted differences have been tested. However, 
all F on the means is 9.72, $ < 01. 


o conserve space, 
the over- 


sted, Thus, one might suppose that 
n productivity in Condition A 
was produced by silent members per se, 
rather than by ambiguous role expectations. 
However, silent members were also present in 
the Condition B groups, where they were as 
silent, “uncooperative,” “ynproductive,” etc., 
as they were in the Condition A groups. 
The difference was that in the Condition B 
groups, the Ss expected the silent members to 
be silent. Therefore, the variable to which 
the decrement in productivity must be 
attributed was the unpredictability of the 
silent members. 

The data in Table 3 support the first 
hypothesis, that the presence of silent members 
produces increased defensiveness. The ¢ ratios 
between Conditions A and C, and A and D, 
are 4.48** and 4.04**, respectively- The 
second hypothesis, that the presence of silent 
members does not produce increased defen- 
siveness if the roles of the group members are 
explicit, is not supported. The differences 


be sugge 
the decrement i 
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TABLE 4 


AVERAGE GROUP SATISFACTION SCALE SCORES UNDER 
Eacu or Four CONDITIONS 


Data Under each Condition 


Statistic on 


D 
GS-Scale A B c 
Scores | Ambiguity | Clarity | Control | Control 
i Sil str itr 
Members) | Maay] w= | WES 
62.60 74.77 74.50 
SD" Sita | 13:82 14.81 | 13.06 
N 30 30 30° | 5 


Note.—High scores indicate satisfaction, w , 
ae ratio between the variances of Conditions A and Cis 
2.31; the over-all F on the means is 18.34, 2 < .01 in both instances, 


between Condition B and Conditions C and 
D on defensiveness are significant, with s 
of 3.45** and 2.79**, respectively. The 
difference between Conditions A and B is 
not significant. 

The data in Table 4 clearly support the 
first hypothesis, that the Presence of silent 
members reduces group satisfaction. The / 
between Conditions A and D is 9,26**, 
that between Conditions A and C is 6.05**, 
As the F ratio on the variances of Conditions 


; partially supported 
by the data. Group satisfaction is higher in 


2; and Conditions C and D, on 
group satisfaction, are 3,93+« and 3.81** 
respectively, i 


At this point, it should be n 
though the primary independ 


S some doubt on 


the interpretation of the results given here. 


One must decide whether Conditions 4 and B 
were really different, as indicated by the 
significant differences between them on pro- 
ductivity and the Group Satisfaction scale; 
or whether they are really similar, as indicated 


by the lack of significant differences on the 
ambiguity item and Defensiveness scale. 

The argument here is that Conditions A 
and B were different, and that role ambiguity 
and role clarification were the variables which 
lowered and raised group productivity. The 
lack of significant differences between Condi- 
tions A and B on the role ambiguity item and 
the Defensiveness scale are interpreted as a 
product of their relatively low reliability 
and of the decay of the role clarification set- 
variable by the time the questionnaires were 
given. However, the data are open to alternate 
interpretation. 


SUMMARY 


Ina laboratory investigation, paid partici- 
pants, who were instructed to remain silent, 
were included in small groups to induce am- 
biguous role expectations. Five-member prob- 
lem-solving groups, each including three naive 
Ss and two silent members, were used in 
each experimental condition. Three- and 
five-member groups were used in control 
conditions, It was hypothesized that am- 
biguous role expectations would reduce group 
productivity and Satisfaction, and increase 
defensiveness, 


All these predictions were well supported 
by the data, : 

Silent paid Participants were included in 
another condition, where group members 
revealed their intended role behaviors. Group 
members thus had accurate expectations 
that the paid Participants would perma 
silent. It was hypothesized that, as a Sp 
of this role clarification, the effects predicte 
above would be reduced or negated. sane 
the presence of silent participants would no 
reduce group productivity and satisfaction, 
or increase defensiveness. i 

Role clarification, with silent paid pron 
pants present in the groups, did, as predicto 
result in normal (i.e., control group lev T 
group productivity. Group satisfaction = 
significantly increased, but not to the noia 
level. The prediction that role clarificat! s5 
would restore the decrement in defensivenê 
was not supported. 
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THE RELATIONSHIP BETWEEN OVERT AND F ANTASY 
AGGRESSION AS A FUNCTION OF MATERNAL 
RESPONSE TO AGGRESSION 
GERALD S. LESSER 
Hunter College 


has developed around the problem of 
establishing relationships between fantasy 
behavior and overt behavior. Different re- 
searchers have used different drive areas, 
different populations, different theoretical 
bases, and different methods of measurement, 
The most conspicuous conclusion is that the 
empirical findings are not in agreement. 
The importance of this area of investigation 
for both clinical practice and personality 
theory has been elaborated by Lindzey (12), 
He concludes that one of the most important 
and difficult problems is the “determination 
of the conditions under which inferences 
based upon projective material directly relate 
to overt behavior and the conditions for 
the reverse” (12, p.18). The present study con- 
cerns the differential conditions under which 
aggressive behavior is learned that may 
allow prediction of how aggressive expressions 
in fantasy are related to those in overt be- 
havior. 
Various studies (9, 13, 16, 18, 19) have 
demonstrated that 
ence between fant 
sociated overt beh: 
drives than for others. Si 


T RECENT years, a voluminous literature 


for sex, and inconclusive Te! Vi n 
Š sults ha: e bee; 
obtained fora wide variety of ot} er varia 
y o 
à lables, 


in their covert manifestations” (13, p: 16) 
while motives that are culturally i 


are apt to show little or no relationship between 
the strength of fantasy and overt expressions. 

Mussen and Naylor (14) have attempted 
to test the first segment of this formulation. 
They contended that lower-class culture 
encourages aggression, and predicted that 
“...in a lower-class group, individuals who 
give evidence of a great deal of fantasy ag- 
gression will also manifest more overt ag- 
gression than those who show little aggression 
in their fantasies” (p. 235). A mixed group of 
white and Negro boys, “...almost all of 
whom had been referred to the Bureau of 
Juvenile Research for behaviors which brought 
them into conflict with school and court 
authorities...” (p, 236), were used as 
subjects. The authors report a statistically 
Significant but not especially strong positive 
relationship between ratings of overt aggres- 
sion and number of aggressive TAT themes. 
Further investigation of Mussen and Naylor $ 
hypothesis would Profit from more precise 
measurement of parental response to aggres- 
Sion, control comparisons, and a more Tep- 
resentative sample. 

The present study seeks to examine the 


comparative consequences of both encourage 


ment and discouragement of aggression 
through the h 


: eae 
Yypothesis that under condition 


of maternal encouragement of aggression # 
greater de, 


gree of correspondence exists al 
tween fantasy and overt aggression of childre? 
than under conditions of maternal discourag® 
ment of aggression, 


METHOD 
Subjects 


0 
The subjects (Ss) were 44 white boys (ages a 
to 13-2) and their mothers. The boys were drawn 6$ 3 
one fifth grade and two sixth grades in two pU ese 
schools. AJ] of the boys and their mothers in ae 
three classes Participated except one mother Toa 
Tefused to be interviewed. The Kuhlmann-Anderst g 
intelligence quotients of the boys ranged from oi 
119, with a mean of 102. The two schools are in a aly 
cent districts and the families constitute a relativ 
lomogeneous upper lower-class group. 
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OVERT AND FANTASY AGGRESSION 


Maternal Attitudes and Practices 


tact AT spect of the environmental conditions of 
aae peuresive behavior was measured, i.e., 
malta: na attitudes and practices supporting or 
satire 5 to aa A structured _questionnaire- 
otaa ta ae a was orally administered to the 
finns tecialin F homes by a male interviewer, Ques- 
constitites ne e support or prohibition of aggression 
the entire ae ne srpnem of the total interview; 
eloewhers C15 eed schedule is described in detail 
Seht iekea otc ea to the present study were 
aggression in children ae — R 
a e gancen 4 a anc thirteen items about the 
havior of h chil r palng iih the aggressive be- 
Bintan ra child. An illustrative item measuring 
oul attitudes toward aggression is: A child 
ta lila 6 E aught to stand up and fight for his rights 
alternan T with other children.” The four response 
disagres ihe of agree, mildly agree, mildly disagree, and 
iter eee allowed for this item. An example of an 
sion is: orien) maternal practices concerning Aggres- 
Bicked o ee our son comes to tell you that he is being 
ang n by a bully at the playground who is his own 
it and size, there would be a number of different 

ungs you might tell him. Would you tell him to ignore 
him and turn the other check?” Response alternatives 
for this item were yes and no. Items that did not involve 
judgments on a four-point scale were transformed to 
have approximately the same range of scores as the 
items that involved four alternatives. 

A single score was obtained for each mother by 
combining all items, assigning plus scores to the re- 
Sponses indicating support of aggression and minus 
Scores to responses indicating discouragement of aggres- 
sion. The range of scores was from +9 to —7, with a 
median score of +2. The corrected odd-even reliability 
Coefficient was .80. 

The distribution of scores for maternal response to 
aggression was dichotomized to form one group of 
mothers (with scores above or at the median) whose 
attitudes and practices were more supportive of aggres- 
sive behavior than those of the other group (with 
scores below the median). The hypothesis demands 
that the correlation between fantasy and overt aggres- 
sion for the children of the mothers in the former group 
be significantly more positive than the corresponding 
correlation for the children of the mothers in the latter 


group. 


Fantasy Aggression 

Fantasy aggression in the children was measured 
through an adaptation of the TAT procedure (13, p. 
3-5). A set of ten pictures was designed. In each pic- 
ture two boys are interacting. The pictures differed 
from one another in the degree to which the instigation 
to aggression was apparent. nas 

To insure complete and accurate transcription of 
the stories, tape recordings were taken. An introductory 
period preceding the fantasy task served both to es- 
tablish rapport between the child and the male exam- 
iner, and to familiarize the child with the recording 
device. Instructions were: 

I’m going to show you some pictures. These are 

Pictures of two boys doing different things. What 
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Pd like you to do is make up a story to each of these 
pictures. You can make up any story you wish; 
there are no right or wrong stories. Say what the 
boys are thinking and feeling and how the story will 
turn out. 

The ten pictures, in the order of presentation, were: 

1. One boy is holding a basketball and the other boy 
is approaching him with arms outstretched. 

2. One boy is stamping upon an ambiguous object 
and the other boy is reaching for the object. 

3. One boy is sitting behind the other boy in a 
classroom and is leaning toward him. 

4. One boy is walking down the street and the other 
boy, with fists clenched, is glaring at him. 

5. One boy, with fists clenched, is staring at the 
other boy who is sitting, head bowed, on a box. 

6. One boy is sawing a piece of wood and the other 
boy is leaning on a fence between them, talking to him. 

7. The two boys, surrounded by a group of other 
boys, are approaching each other with arms upraised 
and fists clenched. 

8. The two boys are making a fire. One boy is 
kneeling to arrange the wood and the other boy is 
approaching, ladened with wood for the fire. 

9. One boy, who is looking back, is running down a 
street and the other boy is running behind him. 

10. Two boys are standing in a field. One boy, with 
his hand on the other boy’s shoulder, is pointing off 
in the distance. 

A fantasy aggression score was obtained for each 
S by counting the number of times the following acts 
appeared in his stories: fighting, injuring, killing, at- 
tacking, assaulting, torturing, bullying, getting angry, 
hating, breaking, smashing, burning, destroying, 
scorning, expressing contempt, expressing disdain, 
cursing, swearing, threatening, insulting, belittling, 
repudiating, ridiculing. 

Fantasy aggression scores ranged from 1 to 15, with 
a mean of 5.3. The corrected matched-half reliability 
coefficient was .86; the inter-judge scoring reliability 
coefficient was .92. 


Overt Aggression 

To measure overt aggression in the child, a modified 
sociometric device, the “Guess who” technique , 
was adopted. The Ss were presented with a booklet 
containing a series of written descriptions of children, 
and asked to identify each of these descriptive char- 
acterizations by naming one or more classmates. 
Fifteen overt aggression items were used, such as 
“Here is someone who is always looking for a fight.” 
A diversity of aggressive behaviors were included; 
items depicted verbal, unprovoked physical, provoked 
physical, outburst, and indirect forms of aggressive 
behavior. 

An overt aggression score was obtained for each 
subject by counting the number of times he was named 
by his classmates. There were substantial differences 
among the three classes in the distributions of the 
overt aggression scores; in order to combine into one 
distribution the scores of children in different classes, 
overt aggression raw scores were transformed into 
standard scores. 

The biserial correlation cocfiicient b-tween the 
overt aggression measure derived from the children 
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and teacher entries for the same “Guess who” aggres- 
sion items was .76 (p < .01). 


RESULTS 


Two Pearson product-moment correlation 
coefficients were obtained. For boys (V = 23) 
whose mothers are relatively encouraging 
or supportive of aggression, the correlation 
between fantasy aggression and overt aggres- 
sion is +.43 (p < .05, two-tailed test). For 
boys (V = 21) whose mothers are relatively 
discouraging of aggression, the corresponding 
correlation is —.41 (p < .10, two-tailed test). 
These coefficients are statistically different 
(p = .006, two-tailed test). 

When the total sample is not separated into 
two groups on the basis of scores for maternal 
Tesponse to aggression, the overall Pearson 
product-moment correlation coefficient is 
+.07. This coefficient is not significantly 
different from zero. 


Discuss1on 


Confirmation is found for the hypothesis 
that under conditions of relative maternal 
encouragement of aggression, a greater degree 
of correspondence exists between the fant 
and overt aggression of children than under 
conditions of relative maternal discouragement 
of aggression, Thus, the direction and extent 
of the relationship between fantasy and overt 
aggression in the child is apparently influenced 
by the maternal attitudes and practices 


arning of aggressive be- 


asy 


It has been predicted (13, 16) that those 
tendencies which are negatively sanctioned or 
prohibited will be high in fantasy expression 
and low in overt expression. This association 


Sroup were classifie h 
as discouraging ag; E 


certain 
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of them do so ineffectively, and thus allow the 
child sufficient release of aggressive feelings 
in overt behavior so that he may not need to 
express aggression in fantasy. An alternative 
speculation Tegarding the concurrence of 
low fantasy aggression and high overt aggres- 
sion in the group exposed to maternal dis- 
couragement of aggression Suggests that a 
child with strong aggressive needs whose 
mother prohibits aggression may assign this 
prohibitory attitude to the adult experimenter 
and suppress fantasy aggression expressions 
in the testing situation; yet this child may 
find avenues for overt expression of aggression 
among his peers, 

In the present study, only one condition 
related to the learning of aggressive responses 
and controls was assessed, maternal attitudes 
and practices. Other possibly critical deter- 
minants that remain to be explored include 
fathers’ behavior and teachers’ attitudes and 
practices. This study has sampled a limited 
range of maternal attitudes and practices 
concerning aggression, Although there is no 
direct manner of determining the absolute 
degree of punitiveness of the most prohibitive 
mother in this sample, it appears unlikely 
that extremely severe and continuous maternal 
Punitiveness is represented. Such severe 
condemnation of aggression might so limit or 
restrict both the fantasy aggression and 
Overt aggression expressions of the child that 
no correlational analysis within such a group 
would be possible. Both the extremes of 
unimpeded permissiveness and severe Con- 
demnation warrant further investigation. 


SUMMARY 


The relationship between fantasy and overt 
expressions of aggression was studied as 2 
function of the maternal attitudes and prac- 
tices toward aggression. Subjects were 
boys and their mothers. The boys’ fantasy 
aggression was assessed through a modified 
TAT approach, their overt aggression was 
measured through a modified sociometric 
technique, and maternal attitudes and prac- 
tices toward aggression were measured by 
use of a questionnaire-interview device. 
Support was found for the hypothesis that 
under conditions of maternal encouragement 
of aggression, a greater degree of correspond- 
ence exists between fantasy and overt aggres- 


oe 


OvERT AND FANTASY AGGRESSION 221 


sion of children than under conditions of 
maternal discouragement of aggression. 
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AFFILIATION MOTIVATION AND PRODUCTIVITY 


IN SMALL 


GROUPS 


RICHARD pECHARMS! 


University of North Carolina 


HE general hypothesis underlying the 
research reported here is that there 
is a systematic relationship between 
the strength of affiliation motivation and 
cooperative group behavior. At first glance one 
might expect that affiliation motivation would 
facilitate cooperative group behavior and 
inhibit competitive behavior, It seems plausible 
that a person high in the affiliation motive 
should perceive working for a group goal 
as an activity instrumental to acceptance by 
the group, and therefore be predisposed to 
Cooperate rather than compete. 
These simple 
by at least two 


may have different behavioral concomitants. 
or instance, unpopular Ss tend to show 
more threat-oriented responses on a projective 
measure of motivation. 
Threat-oriented and goal-oriented motiva- 
tion probably result from differential ex- 
Periences of acceptance in the past. The 
goal-oriented person redintegrates Positive 
Past experiences of acceptance in an affiliation 
situation, while the threat-oriented p 
redintegrates negative experiences of 
tion. The person who has been accept 
the past may have learned that coope 


erson 
rejec- 
ed in 
ration 


? This report derives from a larger study submitted 
to the University of North i 


tation. The author is indeb 


cism. 
The study was conducted in the Program of the Or- 

ganization Research Group of the Institute for Research 
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leads to further acceptance. The previously 
rejected person, on the other hand, may desire 
acceptance so strongly that anticipation of 
rejection is particularly frustrating and leads 
to an aggressive competitive response. Affilia- 
tion motivation of the threat-oriented type 
may therefore facilitate competition rather 
than cooperation. Competition may in turn 
have been rewarded by recognition, which 
may even have led to greater acceptance of 
the originally rejected person. P 

These considerations lead to the following 
hypotheses: 

Hypothesis T. Under conditions that lead 
them to anticipate rejection, Ss high in threat- 
oriented affiliation motivation are more pro- 
ductive on a competitive task than Ss low in 
threat-oriented affiliation motivation. d 

Hypothesis IT. Under conditions that lea 
them to anticipate rejection, Ss high in threat- 
oriented affiliation motivation are less pro- 
ductive on a cooperative task than Ss low in 
threat-oriented affiliation motivation. 2 

Under conditions that lead them to eel 
pate acceptance, Ss high in goal-orient ie 
affiliation motivation should be more pronke 
tive on a cooperative task and less product! : 
on a competitive task than Ss low in go% r 
oriented affiliation motivation. The ae 
experiment cannot adequately test ae 
Predictions concerning goal-oriented yee ven 
motivation, since the design did not oan 
situations in which anticipation of acceptan 
Was made salient. However, the effects s 
goal-oriented affiliation motivation may apps 
in a weaker form in the control groups wbic 


: Be c- 
experienced no arousal of anticipated & 
ceptance or rejection 


METHOD 
Subjects 


A total of 88 male undergraduates participated 
the experiment, They were selected from serial a 
tions of an introductory psychology course accor a 
to their affiliation and achievement motivation score’ 
(n Affiliation, n Achievement) and met in groups 3 
four, five, or six for the experiment. Individual gron] A 
were composed of Ss both high and low in n Affiliatio' 


A 


R, 
Hi 
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The selection procedure assured comparable n Achieve- 
ment scores for Ss in the various cells of the design. 


Design 


7 In each experimental condition, the actual tasks to 

€ performed by Ss were identical. Only the task in- 
iE ueeons and the experimental procedures prior to the 
oe sae manipulated. Half of the groups received 
ao ve task instructions; the other half, coopera- 
Ea asl instructions, The experimenter (E) attempted 
na the affiliation motive and anticipation of re- 
k n in half of the groups; for control, this procedure 

as omitted with the other half. Thus, the experimental 
ee called for a two-by-two design, varying task 
a Tons and arousal conditions. Within each of the 
S cells, Ss could be divided into those above and 

ose below the median of the measures of motivation. 


Measurement of n Affiliation 


de E their first laboratory mecting the Ss were 
to write stories in response to picture stimuli 
a oning to the procedure described by McClelland 

al. (5). A six-picture measure of n Affiliation was ob- 
tained by scoring according to the procedure described 
by Atkinson et al. (1). Five of the pictures have been 
used by McClelland and his associates. The sixth pic- 
ture, representing four men presumably discussing 
Something or chatting casually, was taken from Henry 
and Guetzkow (3). 

Each story containing affiliation imagery was classi- 
fied as either predominantly positive (goal-oriented) or 
Predominantly negative (threat-oriented). A story was 
classified as positive when it contained predominantly 
approach responses, such as statements about positive 
affective relationships between characters, or characters 
Striving for and attaining affiliation goals (positive 
Outcome). Components considered positive were com- 
Panionate activities, such as parties, reunions, visits, 
Small talk, bull sessions, etc. A story was classified as 
negative when it contained predominantly avoidance 
responses, such as statements about concern over 
Separation, or characters striving for an affiliation goal 
Not specifically attained in the story (negative or 
questionable outcome). Negative components included 
Concern over: being left out or rejected; being lonely 
Oor without friends; experiencing physical separation 
Including death; experiencing psychic separation, such 
as disagreements or quarrels (including attempts to 
make amends for past actions that brought about 
Separation or rejection); and experiencing no reciproca- 
tion of love, 

The +Affiliation score of any S was the total of the 
n Affiliation scores for all stories which were judged to 
be predominantly positive. The —Affiliation score was 
derived in a corresponding manner. The sum of the 
Positive and negative affiliation scores for an S (with- 
Out regard to sign) thus equals the total n Affiliation 
Score as described by Atkinson ef al. (1). 

The pictures were found to elicit approximately 
equal numbers of positive and negative stories and thus 
appear to yield a balanced measure of total n Affiliation. 
There was essentially no correlation between the posi- 
tive and the negative affiliation motivation scores 
@ = —.15, N = 88). 
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Interscorer reliability of two experienced scorers? 
was measured throughout the scoring by spot checking 
approximately one-third of all stories scored. The 
agreement between scorers was 95 per cent for affiliation 
imagery alone, and 92 per cent for all categories. 


Affiliation Arousal 


Each S who underwent this procedure was asked to 
rank a series of traits individually from most to least 
complimentary. An eight-minute group discussion 
followed in which the group was to try to agree on a 
single group ranking of the traits. Each S was told that 
he was also to form an opinion of the other members, 
since he would be asked to evaluate them and to choose 
work partners for a subsequent task. After the dis- 
cussion, each S stood before the group and read his 
original ranking of the traits, while all others chose 
two characteristic traits for him. Finally, each S indi- 
cated his preference for work partners by ranking the 
others. This entire procedure was omitted in the control 
groups. 

The arousal procedure followed closely that described 
by Atkinson et al. (1) with the addition of the group 
discussion. This was included in order to give the Ss 
some basis for rating each other and so that some of 
the characteristics of a group could begin to emerge. 
The ratings and partner choices were intended to make 
S acutely aware of being evaluated by other members 
of the group. Both Shipley and Veroff (6) and Atkinson 
et al, (1) point out that this brings out fear of rejection. 
Therefore, Ss in the affiliation arousal condition may 
be considered to anticipate rejection much more 
strongly than those in the control condition. 


Task Instructions 

The competitive instructions presented an individual 
goal of competition for highest productivity, the results 
to be a ranking of individuals in the group. The co- 
operative instructions presented a team goal of co- 
operation among members on individual problems to 
be summated for the group score. 

To make the cooperative task instructions more 
realistic, the group was instructed to divide the total 
task among themselves in a specified way so as to finish 
as many of the problems as possible, each person start- 
ing on a different page in the booklet, working the 
remaining pages in order, and returning to the first 
page if he reached the end. It was stressed that he 
should do as many problems as possible since this would 
increase the group score. The booklets were actually 
composed so that although each S started on a different 
page, they were all actually working on the same 
problems in the same order. The task was therefore 
comparable under all conditions. 


Group Attractiveness 
nnaire, each S' was 


On the post-experimental questio y 
asked to rate on a seven-point scale his agreement with 
his liking for the group during the 
ms were as follows: (a) If I had to 
ork with me on a real problem, I 
f the members of this group. (b) I 


items concerning 
experiment. The ite! 
choose a group to w 
would choose some o! 


2 Mrs. John S. Crittenden and the author. 
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TABLE 1 


MEAN NUMBER OF Tasks ATTEMPTED AND THREAT- 
ORIENTED AFFILIATION MOTIVATION 
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TABLE 2 
MEAN NUMBER or Tasks ATTEMPTED AND GOAL- 
ORIENTED AFFILIATION MOTIVATION 


Mean Tasks Attempted 


Affiliation No 
Arousal (Threat 


Task and Level of Threat- 
Oriented Affiliation 


Mean Tasks Attempted 
Affiliation 


Task and Level of Goal- 
Oriented Affiliation 


o 
cers Motivation Arousal (Threat ] 
Motivation of Rejection) Arousal of Rejection) Arousa! 
eos Competitive Task 
ENA 133,8 126.3 High +Afliation 118.4 na 
10 n, (9) 
n) ; ( ati 120.5 
— Affiliation 112.5 115.0 Low -+-Affiliation 124.8 
a a2) (14) (n) (13) e 
Men 122.2 119.1 
—— — Cooperative Task 5 
(n) (22) (22) Hi +Afüliation 101.8 tes 
a (12) 
Cooperative Task Low +Afiiliation 109.7 T.A 
High — Affiliation 100.6 87.8 (n) (10) @) 
P asia Koz ra 
— Affiliation à zi a Ja M a 
u aii (1) (10) was significantly higher than mean pro 
Nean 105.4 86.3 ductivity in the cooperative task sieno 
(n) (22) (22) (p <. 01). There was no significant difference 


felt some dislike for at least one of the members during 
the tasks. (Reverse scored.) (c) In general, I liked most 
of the people in my group very well, 


Productivity 


The productivity score for eac! 
sum of the problems attempted o 
was a simple arithmetic task. Tasi 
words task. Performance on bo 
previously been found to be r 
motivation (4, 7). 


h S consisted of the 
n both tasks. Task I 
k II was a scrambled 
th of these tasks has 
elated to achievement 


Procedure 


The affiliation arousal wa: 
aroused groups and was omi 
The task instructions placin; 
cooperative task situation wi 
Ss, seated in isolation booths worked on 120 arithmeti. 
problems for 12 minutes, folly E 
on 120 scrambled words. Feedback was suppli 
privately, informing him that hi Pp fed each 5 


was just below the median rank of his group, This was 
ministration of the post- 


al Manipulations, 
trates that mean 


between mean productivity under affiliation 
arousal and under no arousal. Howevels 
considering only the cooperative task grouP® 
mean productivity of Ss under affiliation 
arousal was significantly higher than Pt 
ductivity under no arousal (p < .01). _. u 
A median split of ++Affiliation or —Afiilia 
tion scores within each of the four expel 
mental conditions resulted in unequal 3 
entries. A multiple regression analysis T 
therefore computed (see Table 3) to determi? 
the effects due to level of motivation mii 
each experimental condition, For aga 
purposes the means in Tables 1 and Z - 
high and low levels of motivation will il 
discussed. However, significance levels W! 
refer to tests of the variance due to regress oe 
Hypothesis I predicts higher productivity 
for Ss high in — Affiliation than for 5s 10W at 
—Affiliation on a competitive task a 
affiliation arousal (threat of rejection). Ta 
1 shows a large difference in the predictas 
direction, and regression analysis reven ) 
that this is a reliable difference (p < 
Confirming the hypothesis, Under no age 
there was a similar but insignificant differen i 
The data in Table 1 are also consistent est 
Hypothesis IL which predicts lower va 
ductivity for Ss high in —Affiliation than 
Ss low in —Affliation on a cooperative task 
under affiliation arousal. The regression an 
ysis, however, reveals that this differen’ 3 
falls short of customary standards of sig? 
icance (p < .07). 
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ND 
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i TABLE 3 
ANALYSIS OF VARIANCE DUE TO REGRESSION OF PRODUCTIVITY ON GOAL-ORIENTED 
AND THREAT-ORIENTED AFFILIATION MOTIVATION SCORES 


Arousal Task Motive df Mean Square F b 
Affiliation Arousal Competitive +Affiliation 1 814 1.15 —2.1754 
i — Affiliation $ 5,173 T.2074" +6.0783 
Cooperative +Afñliation l 547 A —1.7836 
— Affiliation 1 2,546 3.59** —4.2643 
a 
No Arousal Competitive +Affiliation 1 110 16 —0.8012 
X —Afħliation l 490 -69 +1.8714 
Cooperative +Affiliation 1 2,027 2.86* +3.4327 
— Affiliation 1 101 14 +0.8500 
Error} 76 709 

* p <10. 

> p <.07. 

s$ <01 


t Twelve degrees of freedom were used in testing variance duc to differences between cell means and variance due to regression. 


For a test of the significance of the inter- 
action of level of threat-oriented n Affiliation 
with kind of task under affiliation arousal, the 
difference between the two computed re- 
gression coefficients was tested. The significant 
relationship is found that would be predicted 
by combining Hypotheses I and II (p < .01). 

It has been noted that the experimental 
design did not provide a situation of heightened 
anticipation of acceptance for testing predic- 
tions for +-Affiliation, but data for the No 
Arousal condition provide suggestive evidence. 
Table 2 presents means for Ss high and low 
in +-Affiliation. As predicted, productivity on 
a cooperative task under No Arousal was 
higher for Ss high in +Affiliation than for 
Ss low in --Affiliation, although the trend 
falls short of statistical significance (p < -10). 

_ Analysis of productivity and total n Affilia- 
tion scores reveals smaller relationships similar 
to those with —Affiliation in the arousal 
cells, but -- Affiliation and —Affiliation scores 
Considered separately seem to be better pre- 
dictors of performance than total n Affiliation 
under the present conditions. 


DISCUSSION 
hich are in general 
ble 1, were based 
e pro- 
is the 
in 


The main hypotheses, w 
supported by the data in Ta 
on the assumption that competitivi 
ductivity under certain conditions 
result of hostility induced in Ss high 
—Afüliation by threat of rejection. Using the 
Group Attractiveness Scale as a rough measure 
of hostility toward the group, it was possible 
to examine the relationships between hostility, 


—Affiliation, and the experimental manipula- 
tions. Although the relationships were all 
small and insignificant, they were remarkably 
consistent. Hostility was greater in Arousal 
than in No Arousal cells, and greater for Ss 
high in — Affiliation than for Ss low in —Affilia- 
tion. Competitive productivity followed the 
same pattern. It was higher in the Arousal 
cells, and (as shown in the test of Hypothesis 
I) high for Ss high in —Affiliation. Further, 
competitive productivity was higher for Ss 
high in hostility than for Ss low in hostility; 
and, conversely, cooperative productivity was 
lower for Ss high in hostility than for Ss low in 
hostility. 

Table 1 also shows that cooperative pro- 
ductivity was significantly greater under 
Arousal than under No Arousal. This finding 
cannot be attributed to threat of rejection, 
which should inhibit cooperative productivity, 
especially in Ss most influenced by threat. 
It may well be attributable to the group dis- 
cussion, in which only the aroused group 
participated and which allowed Ss to become 
acquainted. They seemed to gain a more 
positive attitude toward the situation, as 
indicated by post-experimental discussions 
with the groups, and this positive attitude 
could well have produced an increased incen- 
tive for cooperative work. 

Further examination of the data shows that 
cooperative productivity under arousal was 
relatively greater for Ss low on any of the 
measures of affiliation motivation. The most 
striking difference was with total n Affiliation. 
Cooperative productivity for Ss low in n 
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Affiliation was higher in the aroused than in 
the control condition, (aroused mean = 
116.8, control mean = 79.5), while pro- 
ductivity for Ss with high n Affiliation was 
approximately the same under the two condi- 
tions (aroused mean = 89.0, control mean 
= 90.5). The overall increase under arousal, 
which is attributable entirely to the Ss with 
low motivation, may be similar to Wendt’s 
findings (7) with achievement motivation 
under scheduled and nonscheduled task 
conditions. He interprets his results as showing 
the effect of imposing extrinsic motivation 
upon Ss low in intrinsic motivation in the 
scheduled situation, thereby bringing their 
productivity up to those who were intrinsically 
motivated and had high productivity under 
either scheduled or nonscheduled conditions, 
In the present experiment, however, the 
productivity under arousal of Ss low in n 
Affiliation was significantly higher than that 
of the Ss high in n Affiliation. It would seem 
that the extrinsic threat of rejection has 
interacted with the intrinsic n Affiliation 
thereby inhibiting increased cooperative pro- 
ductivity under the arousal condition for Ss 
high in n Affiliation, while the group discussion 
facilitated cooperative productivity for Ss 
low in n Affiliation to whom the threat was 
not as important. 


SUMMARY 


This study tested the hypotheses that under 
conditions favoring the anticipation of rejec- 
tion, Ss high in threat-oriented affiliation 
motivation are (a) more productive on a 
competitive task, and (b) less productive on a 
Cooperative task than Ss low in threat. 
oriented affiliation motivation. These hypothe- 


ses were in general supported b: d; 
undergraduate Ss. prem ae 


In the control condition 
neutrally with respect to ar 
acceptance or rejection, 
oriented affiliation motivati 
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Ss low in this respect) tended to be more 
productive on a cooperative task. This 
relationship did not reach statistical signi- 
ficance. 

Under Affiliation Arousal (which included 
both a group discussion and threat of re- 
jection) there was higher cooperative pro- 
ductivity than under No Arousal, a difference 
contributed almost entirely by Ss low in 
total affiliation motivation (goal-oriented plus 
threat-oriented). This result was interpreted 
as showing an interaction between the effects 
of the group discussion and threat of rejection. 
The group discussion facilitated cooperative 
productivity in general, but among Ss high 
in affiliation motivation this effect was 
counteracted by the threat of rejection. 

The other relationships between overall 
affiliation motivation and productivity were 
insignificant. The importance of distinguish- 
ing between goal- and threat-oriented affilia- 
tion motivation was stressed, 
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COGNITIVE SET AND MOTIVATIONAL FACTORS IN THE 
PERCEPTION OF NEUTRAL AND THREAT- 
RELATED STIMULI 
RUTH C. WYLIE 


Sarah Lawrence College 


ECENTLY, experimental psychologists 
have been interested in the proposi- 
L tion that perceptual thresholds are 
higher for so-called emotionally threatening 
stimuli, and lower for positively valued stimuli 
than for affectively neutral ones. Tests of such 
a proposition have yielded ambiguous results 
because of difficulties in controlling or knowing 
the status of such factors as: (a) subjects’ at- 
titudes toward the stimuli, (b) frequency of 
subjects’ past experiences with the stimuli 
(c) subjects’ cognitive sets or expectancies; 
and (d) the availability of the required re- 
sponses as they are determined by subjects’ 
previous practice in making the responses 
and willingness to report honestly (1-3; 5-11). 
This experiment employed new means to 
control the above-mentioned varjables in a 
comparison of the influence of motivational 
factors plus cognitive set with the influence of 
cognitive set alone. In this design the experi- 
menter (E) induced the desired emotional state 
and cognitive expectancies in the subjects 
under laboratory contro: 
tested numbers as stimuli, 
specially devised conditions, 
proved control of factors b, c, and d. 
The following hypotheses were tested: 
Hypothesis 1. When subjects (Ss) have just 
been given an auditory presentation of a stimu- 
lus and have reason to expect that they may 
e same stimulus visually, their dura- 
for such visual stimuli are 
rwise comparable visual 
t has been 


1; and the use of pre- 
presented under 
provided im- 


be given th 
tion thresholds 
lower than for othe 
stimuli for which no such cognitive se 
established. 
Hypothesis 2. W! 
a task, their duration 
stimuli indicating that they have failed are 
greater than their duration thresholds for 
otherwise comparable visual stimuli indicating 


hen Ss desire to succeed in 
thresholds for visual 


oscope used in this study was 
the American Philo- 
so to Dr. Georgiana 
ing task at another 


1 The Gerbrands tachist 
purchased under a grant from 
sophical Society. Thanks are due al: 
Wylie who pretested the ego-involv 


ve succeeded. These differences 
greater than could be 
basis of cognitive set or 


that they ha 
are expected to be 
accounted for on the 
expectancy alone. 
Hypothesis 3. When Ss desire to succeed in 
a task, the content of their prerecognition 
guesses about visual stimuli which indicate 
that they have failed resembles the content 
of stimuli which, had they been presented to S, 
would have indicated success. This distortion 
of prerecognition guesses is expected to be 
greater than could be accounted for on the 
basis of cognitive set or expectancy alone. 


METHOD 


Stimulus Materials 


Two-place numbers were used as stimuli on the 
assumptions that: (a) such stimuli would be emotionally 
neutral to Ss until experimental conditions produced an 
emotional set toward them; (b) the response avail- 
ability for all two-place numbers would be the same, 
insofar as this is determined by frequency of past 
visual exposure, practice in pronunciation, and lack of 
embarrassment in pronunciation. On the basis of a 
preliminary experiment, testing the tachistoscopic 
recognizability of two-place numbers of a standard print 
type and size, these numbers were classified into three 
groups: hard, medium, and easy to recognize, for use in 
matching the “threatening” and “rewarding” numbers 


in the main experiment. 


Subjects 


The Ss w 
naive with respect 
They were divided into three 


ere 51 volunteer female college students, 
to the purpose of the experiment. 
groups of 17 each. 


Procedure 

Group I. Group I Ss were to 
pate in two separate experiments. In the “first experi- 
ment” (Task 1), each S was given individually a spe- 
cially devised and pretested task. She was told that, 
although the parts of the task were simple, this was a 
new intelligence test which revealed the ability of a 


person to keep her head and concentrate in a fast-mov- 


ing, distracting situation which might “rattle” her. 
She was instructed that preliminary information indi- 
cated that this is a very important kind of intelligence 
for everyday living, and she was encouraged to keep 
trying by being told she would receive partial credit 
for nearly correct answers. This task consisted of 14 
chains of arithmetic operations, with ten operations 
per chain. The Æ read each chain aloud at the rate of 
about one operation every five seconds, while S did the 


Jd they were to partici- 


college. 
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operations mentally and wrote only her final answer 
for each chain. The oral presentation and mental com- 
putation prevented S from knowing whether any answer 
was right or wrong unless she had completely failed to 
give an answer or had obtained a number with more 
than two places. The S was asked to guess how many 
problems she had done correctly and how many almost 
correctly; and to tell how sure she was of this guess. (No 
S was sure of her guess.) Each S was asked to rate her- 
self on her ability on this type of task, as compared 
with the average college student. Finally, she was given 
fictitious norms which were higher than the actual 
attainment of the college group. Interviews at the end 
of the entire experimental procedure indicated that Ss 
were ego-involved on Task 1. 

After Task 1, S was told that the “first experiment” 
was finished and she was given a brief rest period before 
the “second experiment.” Meanwhile £ prepared, in 
another room, a series of 14 numbers, to be presented 
in a Gerbrands mirror tachistoscope. Regardless of S’s 
answers on Task 1, half the numbers for this series were 
selected to agree and half to disagree with S’s answers, 
The disagreeing numbers were chosen to be at least as 
readable as the agreeing ones. Agreein; 
items were put in random sequen 
practice and fatigue. 

The S was then brought into the 
and asked if, while she was in the 1 
she would mind helping Æ test a 
effort was made to avoid ego. 
very accurate in the perceptual! 
keep her from suspecting the co 
1 and 2 by telling her that Æ w: 
ing conditions and apparatus f 
ment, and was merely trying 
task less boring to S by usin, 
answers to Task 1. (The purpose of avoiding ego- 
involvement in perceptual accuracy was 
masking possible motivationally- 


g and disagreeing 
ce to control for 


tachistoscope room 
laboratory anyway, 
new apparatus. An 
-involving S$ in being 
l task as such, and to 
mnection between Tasks 
as only testing the light- 
or use in a further experi- 
to make this Perceptual 
g for stimuli the correct 
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discerned the true connection between Tasks 1 and 2. 
The Ss reported that the directions had been con- 
vincingly misleading, but some of them said they had 
wondered whether hearing the number and/or wishing 
to see it might have influenced their perceptions, How- 
ever, they all reported trying hard to base their guesses 
exclusively on the visual appearance of the stimuli. 
Each S was asked to keep the experiment secret so that 
future Ss would be naive, 

It can be seen from the description of the procedure 
used with Group I that both expectancy and “wishful 
thinking” might influence their perceptual thresholds 
for those tachistoscope numbers which agreed with the 
numbers E read out to them. i 

Group II. Each Group II S was arbitrarily paired 
with a Group I S. The treatment given to Group I Ss 
was intended to make their experiences in the tachisto- 
Scope room exactly the same as those of Group I, except 
that their attitudes toward the numbers used as visual 
stimuli should be emotionally neutral, allowing one to 
study the influence of cognitive expectancies in the ab- 
sence of motivational influences. Consequently, these 
Ss heard nothing about Task 1, but went directly to the 
tachistoscope room where they were given the same in- 
structions for Task 2 as had been given to Group I, with 
the following exception: Group II Ss were told that E 
would “read out a number” instead of that Æ woul 
“read out your answer to a problem on the intelligence 
test you just took.” Any given Group II S was given 
the same list of “read out” and “viewed” numbers a5 
had been used for her corresponding Group I S. Each 
Group II $ was told, “I can’t tell you how many of me 
numbers flashed on the screen will be the same as those 
on my list and how many will be different. There might 
be any degree of agreement between zero and 100 a 
cent. The greatest likelihood is that there will be abou 
> agreements out of 14 numbers which you will kava 
flashed on the screen. You can’t be sure of this, nor 
ever.” The number of agreements suggested to a give ie 
Group II S, was the number of answers the correspo” 
ing Group I S stated she thought she had gotten correc 
at the end of her Task 1 test. 

A careful inspection of the procedures given Groupe 
T and TI suggested that any differences in performance 
between these groups were to be interpreted with cau 
tion, because the Parallel instructions to the groups 
did Not suffice completely to equate cognitive expec 
tancies between them, The Ss in Group I, for examples 
ad reason from Task 1 to expect (as well as to hope 
that some of the numbers might be nearly agreeing, 
not exactly agreeing, even though the tachistoscopa 
directions did not mention this possibility. On t i 
other hand, the Group TT Ss had had no comparable 
Previous experience that might have established a care 
nitive expectancy of a certain number of near agen 

ments. In an attempt to equate this factor of cognitiv? 
expectancy more effectively, the procedure used m 
Group II was administered to a third group, mir 
the single modification that the directions were change 
by the addition of the underlined clause below: “- - °° 

Conant likelihood is that there will be about—- 
agreements out of 14 numbers which you will heave 
flashed on the screen, with perhaps — coming far 
close.” The exact number mentioned in the underline 
clause was that which the corresponding Group 
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thought she had gotten nearly correct in Task 1. By 
this change of directions the expectancy of number of 
near as well as exact agreements was made more com- 
parable between each Group III S and her matched 
Group I counterpart. 


RESULTS 


For each S, the median threshold for all 
agreeing numbers was found (called A) and 
the median threshold for disagreeing numbers 
was obtained (called D)? (A — D) was pre- 
dicted to be negative in the cases of all three 
groups. Groups II and III were employed to 
test Hypothesis 1 which was confirmed by the 
finding that 31 out of 34 Ss in these groups 
showed negative values for (A — D), a pro- 
portion significant far beyond the .01 level. 
Since other variables have been controlled, 
this result can reasonably be attributed to the 
effects of hearing the number and expecting the 
possibility of seeing it, i.e., to cognitive sets. 

Although simply hearing the number and 
expecting possible agreement leads to faster 
perception, the second hypothesis asserts that 
hoping to see what one has just heard and 
hoping not to see any other number should 
make the (A — D) values of the Group I Ss 
more negative than those of Group II or Group 
III Ss. In order to make such comparisons, 
a refined measure of the discrepancy between 
thresholds for agreeing and disagreeing items 
was needed that takes into account the 
individual differences in intra-individual varia- 
bility in thresholds, in terms of which the 
groups were not equated. To obtain such a 
measure, an index of intra-individual varia- 
bility (here called 7) was first computed for 
each S by taking the range of her middle 
six thresholds.’ Her (A — D) value could 
then be evaluated in relation to her overall 
intra-individual variability. The score used 


2 An item on which a Group I S knew she had failed 
because she got no answer or got more than a two-place 
number for an answer was eliminated from the calcu- 
lation of D of both the Group I S and the correspond- 
ing Group II S. To keep the counterbalancing of the 
ordinal position of agreeing and disagreeing items in- 
tact, in such a case, the nearest agreeing item was also 
eliminated from the calculations of A for the Group I 
and corresponding Group II S. 

3 This measure of variability was used rather than 
the SD because: (a) the number of thresholds in each 
individual’s distribution was small, (b) the individual 
distributions were often skewed. (I approximates the 
interquartile range without involving interpolations or 
decimal values unwarranted by the size of the dis- 
tributions or precision of the data.) 
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as the dependent variable in testing Hypoth- 
esis 2 may be summarized thus: 


(A =D) 
I 
where 
A = S’s median threshold for agreeing num- 
bers 
D = S’s median threshold for disagreeing 
numbers 


I = range of middle six thresholds for S 


The difference in mean score between Group 
I and Group II was found to be +0.11, which 
yielded an insignificant / value of 0,60!, while 
the difference in mean score between Group 
III and Group I was only —0.01. Thus, the 
data give no support to Hypothesis 2; i.e., 
there is no indication that wishful thinking 
added to expectancy produces any greater 
effect on duration thresholds than does 
expectancy alone. 

The third hypothesis postulated that pre- 
recognition guesses of the Group I Ss might 
reveal “wishful thinking” by tending to be 
closer to the “read-out” number than were 
the prerecognition guesses of Group II or 
Group III Ss. To test this hypothesis, all 
prerecognition guesses within +10 of the 
“read-out” number (S’s answer in the case of 
Group I), were tallied and compared to the 
total number of prerecognition guesses made 
by S, yielding a proportion score for each S. 
The differences between the proportion score 
of each Group I S and that of her correspond- 
ing Group II S were obtained by subtraction 
across pairs. The mean difference was 0.14 
and ¢ = 2.48, which is significant between the 
(02 and .01 level on a one-tailed test. These 
results support the hypothesis that the Group 
I Ss, who are hoping to see a number at least 
near their answer, will tend to see more 
numbers near the “read-out” number than 
will the control Ss, who have no reason to 
care whether the tachistoscope number is near 
the “read-out” number. However, for reasons 
described under Method, a comparison of 
Group III with Group I provides a better 
controlled test of Hypothesis 3. When the 
prerecognition guesses of Group IM were 
compared to those of Group I in the same way 


ariable was found 


4 No correlation on the dependent v: 
so uncorrelated ¢ 


between Group I and Group III Ss, 
formulae were used. 
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that II and I had been compared, the differ- 
ence was +.08, which favored the hypothesis 
at the .10 level (one-tailed test). Groups IT 
and III did not differ significantly between 
themselves as to prerecognition guesses, so 
they were combined and then compared with 
Group I. The difference was +.11, with a P 
slightly >.05 on a one-tailed test. 


Discussion 


The results of this experiment give clear-cut 
evidence for the effects of cognitive expect- 
ancies upon duration thresholds, whether or 
not motivational influences which might 
operate in the same direction were present, 
The data give some support to the notion 
that prerecognition guesses may be modified 
by motivational influences such as desire to 
succeed in an ego-involving task. No support 
is given the proposition that such motivational 
influences affect duration thresholds when 
other variables are held constant (the usual 
definition of perceptual defense), 

The question arises as to whether those 
differences which were found are attributable 
to effects on the perceptual processes or to 
effects on the response Processes, to follow 
the distinction developed most recently by 
Garner, Hake, and Ericksen (4). It seems 
likely that the effects are on the perceptual 
Processes in view of the operations instituted 
to control for the influences on response proc- 
esses. In the first place, the use of numbers 
should make motivational Teasons for withold- 
ing responses less than th 


should tend to reduce i 
other sources. 


The results on co 


gnitive set are in line wi 
n e 
much previous wor = 


3 and they further point 
trolling for this fa 
es where it js not the 


S’s past experience with the stimuli and the 
like. The effects obtained are sometimes 
diminished or even reversed when these 
other variables are altered (6, 10, 11). In one 
well controlled recent experiment, no evidence 
of perceptual defense could be obtained (13). 
It was hoped that the new methods and stim- 
ulus materials introduced in this study would 
also effectively control the variables which 
have led to ambiguity in previous studies. 
Insofar as this aim has been realized, these 
results give no evidence of motivational 
effects on duration thresholds under com- 
parable conditions of cognitive expectancy. 
The Present findings, and those of other 
experiments, may perhaps be reconciled with 
the Concept of perceptual defense on the 
assumption that perceptual defense occurs 
only within certain limiting conditions. So 
far, these conditions remain to be explored and 

Specified precisely, and one value of this ex- 
periment lies in suggesting what some of them 
may be. First, it is Possible that laboratory- 
manipulated €go-involvement, while desirable 
for Precision of control, may produce motiva- 
tion too weak to affect duration thresholds. 
Since the motivation was apparently strong 
enough to exert some effects on the pre- 
Tecognition guesses, one might infer that 
different minimum strengths and/or durations 
of motivation may be required to affect dif- 
ferent measures of perceptual distortion. 

_ it might be argued, second, that motiva- 
tional effects do not appear when some other, 
more influential variable such as cognitive set 
1S present to override them, On this view, the 
Present results could be interpreted as analo- 
gous to the findings of Postman and Schneider 
(12) on familiarity, from which they con- 
cluded that differences in value rank produce 
Significant differences in perceptual sensitivity 
to unfamiliar but not to highly familiar words- 

Finally, there is the question whether the 
findings on perceptual defense may vary as 4 
function of types of material used. Perhaps 
affect can become attached to some stimuli 
(e.g., words and pictures) more easily than to 
numbers. If so, the Psychological variables 
leading to Such differences among materials 
need to be determined, 

In any event, the present findings suggest 
that caution and further empirical qualifica- 
tions are in order before making generaliza- 
tions about the occurrence of perceptu 
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defense, and about the variables which limit 
it or with which it interacts, if and when it 
occurs. 


SUMMARY AND CONCLUSIONS 


An experiment testing the following three 
hypotheses was described: 

1. When Ss have just been given an auditory 
Presentation of a stimulus and have reason to 
expect that they may be given the same 
stimulus visually, their duration thresholds 
for such visual stimuli are lower than for 
otherwise comparable visual stimuli for which 
no such cognitive set has been established. 

2. When Ss desire to succeed in a task, 
their duration thresholds for visual stimuli 
indicating that they have failed are greater 
than their duration thresholds for otherwise 
comparable visual stimuli indicating that they 
have succeeded. These differences are expected 
to be greater than could be accounted for on 
the basis of cognitive set or expectancy alone. 

3. When Ss desire to succeed in a task, the 
content of their prerecognition guesses about 
visual stimuli which indicate that they have 
failed resembles the content of stimuli which, 
had they been presented to S, would have 
indicated success. This distortion of pre- 
recognition guesses is expected to be greater 
than could be accounted for on the basis of 
cognitive set or expectancy alone. 

The design attempted to control by experi- 
mental manipulation variables which have 
caused difficulty in previous experiments in this 
field, i.e., S’s attitude toward the stimulus 
materials, frequency of his past experience 
with perceiving the stimulus materials and 
pronouncing them, inherent readability of the 
stimulus forms and willingness to report 
honestly. 

The influence of cognitive set on duration 
thresholds was demonstrated (Hypothesis 1). 
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No evidence was found for the effects of 
motivation on duration thresholds (Hypothe- 
sis 2), but some evidence was found for the 
influence of motivation on the content of 
prerecognition guesses (Hypothesis 3). 
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reward from parents of children in the age range sam- 
pled. It was further intended that each situation Tepre- 
Sent an instance of behavior in one of the following six 
areas: 

1. Feeding and oral activities: e.g., “insists on eating 
with fingers,” “seldom finishes meal,” etc. 

_2. Cleanliness-toilet training: e.g., “frequently has 
toilet accidents because won’t interrupt play.” 

3. Sex: sexual curiosity, sexual and obscene remarks, 
handling genitals, body exposure. 

4. Aggression: Items involve insolence and impu- 
dence, hitting and pushing, temper tantrums, explicit 
disobedience, willful property damage. 

5. Dependence: These items attempt to depict be- 
havior suggesting that the child lacks self-reliance, is 
attempting to gain parental support, etc., e.g., “whines 
or begs for help unnecessarily.” 

6. Independence: This area includes situations that 
are intended to suggest that the child is showing initia- 
tive, curiosity, assertiveness, e.g., “insists on own way 
with other children,” “wanders away from home,” etc. 

In addition to the structured interview phase, each 
mother was asked to describe her three biggest problems 
concerning her child, to answer two open-end projec- 
tive-type questions, and to fill out a combined version 
of the short-form (TFI) scale (4) and F scale (1). The 
scores thus obtained (M, 3.70; SD, 1.09) provided the 
measure of authoritarian personality. The interviews 


took from 114 to 214 hours. 


RESULTS 


Authoritarianism was found to be unrelated 
to each of the following variables: age of the 
mother, age of the child, sex and birth order 
of the child, and number of children in the 
family. The behavior situations included in the 
interview schedule were quite comparable to 
and representative of the kinds of behavior 
problems reported by the Ss. 

The main results are described below with 
reference to three experimental predictions. 

1. The selection of nonlove-oriented discipli- 
nary techniques should be a positive function 
of maternal authoritarianism score. To test 
this prediction, a product-moment correlation 
was computed between each mother’s au- 
thoritarianism score and the number of situa- 
tions for which she selected nonlove-oriented 
techniques. The correlation thus obtained 
was .63 (p < .001). 

For a more detailed analysis, the distribu- 
tion of authoritarianism scores was divided 
into thirds, with cutoff scores for highs and 


ated F scale of 12 items was used; it 
4, 37, 42 from 

ined Items 4, 9, 13, 18, 21, 25, 26, 34, 37, 
Seg Ttem 44 from Form 60, and Item 55 from 
Form 78 of the original F scale (1). The TFI items were 
the 12 comprising the short rm presented by Levin- 


son and Huffman (4). 


3 An abbrevi 


233 


TABLE 1 
MEAN PERCENTAGE OF MOTHERS AT EACH LEVEL oF 
AUTHORITARIANISM WHO SELECT EACH 
TYPE OF DISCIPLINE 


Type of Behavior Area 
Discipline and Overall 
anes of | Inde] De- | 23 | peca- As- | Mean 
Authoritarian: | Dend-| pend-| 8.5 ing | SEX | gres- 
ym ence | ence | O sion 
Love 
High 30.33] 65.12] 59.52} 47.29] 70.26] 18.64] 48.53 
Medium 27.90] 69.52} 69.64] 58.93] 80.57] 23.14] 54.95 
Low 45.05] 84.76] 88.69] 83.31] 89.12} 42.52) 72.24 
Mean 34,43] 73.13] 72.62] 63.18] 79.98} 28.10} 
Nonlove 
High 60.55] 23.10] 33.93} 30.57] 20.62} 69.40) 39.69 
Medium 53.02] 16.45] 26.19] 20.48] 11.17] 47.57] 29.15 
Low 47.60] 7.36] 8.93] 8.81] 8.14] 41.88) 20.45 
Mean 53.72| 15.64] 23.02) 19.95] 13.31] 52.95 
“Ambiguous” 
High 9.12| 11.78} 55| 22.14| 9.12| 11.96] 11.78 
Medium 19.08| 14.03] 4.17] 20.59) 8.26| 29.29] 15.90 
Low 7.35| 7.88] 2.38] 7.88] 2.74] 15.60) 7.31 
Mean 11.85] 11.23] 4.36) 16.87] 6.71] 18.95 


lows falling at 4.1 and 3.2, respectively. The 
relationship between mothers’ authoritarian- 
ism and preferred disciplinary techniques 
may be observed in the right-hand marginal 
column of Table 1. The body of Table 1 lists 
the percentages of type of discipline selected 
by mothers of a given authoritarianism level, 
in the various behavioral areas. Mothers at all 
levels of authoritarianism in general selected 
more love-oriented than nonlove-oriented 
control measures. However, it will also be seen 
that as authoritarianism level increases, 
nonlove-oriented techniques are preferred 
increasingly, and that as authoritarianism 

ency to select love- 


level decreases, the tend k ‘ 
oriented techniques increases. Since practically 


all of the previous research concerning au- 
thoritarianism has employed only the extremes 
of the distribution, the confirmation of this 
prediction also lends support to the contention 
that authoritarianism is an intrinsic contin- 
uum, not merely a dichotomous or trichoto- 
mous aggregate of traits. j 

2. High authoritarians in comparison wiih 
low authoritarians, should select relatively more 
nonlove-oriented responses for the sex and ag- 
gression areas respectively, than for the re- 
maining areas. This prediction was based on 
the supposition of authoritarian personality 
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theory that children’s sexual and aggressive 
behavior are especially unacceptable to author- 
itarian parents, and especially subject to 
control by discipline. 

If nonlove-oriented measures are the 
characteristic disciplinary techniques of high 
authoritarian parents, and if children’s sexual 
and aggressive behavior is especially unac- 
ceptable to them, then it might be expected 
that authoritarian parents would select more 
nonlove-oriented responses for these areas 
than for the other areas. To test that portion 
of the prediction pertaining to aggression, 
each high and low authoritarian mother was 
given a score based on the differences between 
the percentages of nonlove-oriented responses 
in the aggression area and the combined 
value of the remaining areas (minus sex). 
The ¢ ratio of 1.8 between the mean scores of 
the highs and lows, was close to significance 
at the .05 level. 

The failure of the data to attain an ac- 
ceptable level of significance may reflect the 
arbitrary grouping of genotypically different 
items in the particular categories adopted, 
The behavior situations classed under aggres- 
sion included items differing markedly from 
each other in interpersonal significance, Per- 
haps behaviors embodying threat to authority 
relationships may be more germane to the 
authoritarian personality dimension than the 
particular situations chosen for inquiry. For 
example, one group of aggressive behaviors 
that may be inter 
involving threat to parental authority dis- 


criminated significantly between highs and 
lows in an individu: 


areas of problem behavior as 
finer analysis to yield s 
gories in terms of inte 

Not only did the dat: 
part of the prediction 
as Table 1 indicates, t 
highs and lows in th 


was impressed with 
ceptance which the 
cerning sexual manife: 


Many of the mothers indicated that their own 


mothers would have been shocked by such 
behavior, and would not have been so tolerant. 
While this suggests a cultural trend toward 
increased leniency in middle-class attitudes 
toward the expression of sex, it also is very 
likely in considerable part a reflection of the 
cultural sophistication of the sample. More- 
over, sexuality per se was generally not 
recognized nor accepted as intrinsic in the 
particular behavior, but the behavior was 
generally equated to intellectual curiosity, 
or otherwise rationalized as part of the child- 
hood learning process. 

As may be observed in Table 1, the class of 
behavior control techniques preferred varied 
with behavior area. To determine if these 
variations exceeded chance probability, anal- 
yses of variance were computed in which the 
six behavior areas and the three authoritarian- 
ism levels were the independent variables 
and the classes of disciplinary techniques 
were the dependent variables, The data sum- 
marizing the analysis of variance for nonlove- 
oriented disciplinary techniques are presented 
in Table 2. Since love and nonlove response 
frequencies are essentially mirror images of 
each other, the results of the analysis of 
variance for love-oriented discipline were 
Substantially the same. Since the distributions 
were heterogeneous in variance and form, 
the F ratios were interpreted as inflated. 
While the very large F ratio (137) suggests 
that it would be quite safe to assume that the 
main effect of behavior areas is significant, 
the comparatively small F ratio (2.2) for 
interaction effects does not warrant the con” 
clusion of a significant interaction, This 
finding corresponds to the nonsignificant 
results for Prediction 2. 

Inspection of Table 1 indicates that nonlove- 
oriented responses were selected most fre- 
quently in the independence and aggression 
areas, and least frequently in the sex and 
dependence areas, One possible explanation 
is that the independence and aggression areas 
depict behaviors that are more threatening to 
Parental control and authority. However, 
many of the mothers explained their choice 
of response to the independence items Te- 
ferring to certain behaviors as potentially 
dangerous to the child by indicating that 
threat to their child’s safety was the one time 
they did not hesitate to resort to physica 
Punishment. It might be surmised that 88- 
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À TABLE 2 
ANALYSIS OF VARIANCE OF PREFERENCE FOR NONLOVE- 
ORIENTED DISCIPLINARY TECHNIQUES 


Source df SS MS e PF 


Levels of authoritari- 2 46796.62 23398.31 33.52 .01 


anism 
Subjects within levels 
Behavior arcas 
Level X Area 
Area X Subjects within 
levels 


123 85864.80 698.09 
5 217209.28 43441.86 137.07 „01 
701.41 2.21 .05 
316.94 


10 7014.05 
615 194917.34 


gressive behavior on the part of the child 
might provoke parental aggressive retaliation. 
The most frequent reason advanced by the 
mothers for their reluctance to select nonlove- 
oriented responses for dependency behavior 
was fear that these techniques might cause 
the child to feel rejected. 

3. The low authoritarian’s description of her 
child-rearing philosophy and related altitudes 
should contain more altitude statements with the 
judged consequence of maintaining the child’s 
approach tendency to her, than should the high 
authoritarian’s. The high authoritarian’s “phi- 
losophy” is expected to be judged as more 
likely than the low authoritarian’s to produce 
and strengthen an avoidance tendency in the 
child, For the purpose of testing this predic- 
tion, each mother was asked the following 
open-end questions: “What is your philosophy 
of child-rearing?” and “What do you think 
are the most important things to take into 
consideration when dealing with child behavior 
problems?” To reduce the scope of the task of 
rating, only the protocols of the mothers with 
the 30 highest and lowest authoritarianism 
scores were used in the statistical test. These 
protocols were submitted to two additional 
judges who rated them in terms of the antici- 
pated effect the depicted behavior would have 
on the tendency of the child to approach or 
avoid.the mother. The protocols were in no 
way identified with regard to authoritarianism 
and, in fact, the judges were unacquainted 
with authoritarian theory or the hypotheses of 
the study. Each protocol was divided into 
meaningful, coherent idea units on a purely a 
priori basis, in an attempt to facilitate com- 
parability of ratings. The task of the judges 
was to sort each unit of the mother’s state- 
ments into an “approach”? category, Lë, 
behavior thought to strengthen the child’s 
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approach tendency; an “avoidance” category; 
or a “miscellaneous” category. Although the 
number of units in the protocols varied, the 
mean number of units of the highs was close 
to that of the lows. However, as a precaution 
and in an attempt to compensate for unequal 
length of protocols, the score for each protocol 
consisted of the summed approach rating 
minus the summed avoidance rating. Thus, 
the final score for each protocol is a discrepancy 
score and is assumed to represent the relative 
excess of approach over avoidance (or vice 
versa). 

Fairly high interrater agreement is reflected 
in the product-moment correlation of .72 for 
the lows and .73 for the highs. It was concluded 
that there was sufficient reliability in the 
judges’ ratings to warrant their combination 
in mean judgments. The mean discrepancy 
score of the lows was 36.1, SD, 7.6; that of the 
highs 30.5, SD, 6.42; each case representing 
an excess of approach over avoidance judg- 
ments. The ż test computed to determine the 
significance of the difference between the 
scores of high and low authoritarians yielded 
a value of 3.08, P < .01, in support of Pre- 
diction 3. 

The confirmation of this prediction suggests 
that the structured interview schedule has 
construct validity (2), and that the socializa- 
tion techniques differentially preferred by high 
and low authoritarians have conceptual 
significance in terms of approach and avoid- 
ance principles. 

‘An attempt was also made to determine 
if maternal child-rearing preferences varied 
significantly as a function of such variables as 
sex and age of the child, and if interaction 
between either of these variables and authori- 
tarianism could account for differences in the 
preferences. The child’s age range was arbi- 
trarily divided into three groups: 214-34; 
344-414; 4445, and three analyses of vari- 
ance were carried out, in which the three age 
ranges and three levels of authoritarianism 
were the independent variables, and the 
three disciplinary techniques were succes- 
sively the dependent variables. Disciplinary 
techniques varied significantly as a function 
of age only in the case of “ambiguous” re- 
sponses. There was a slight, but significant, 
tendency for “ambiguous” responses to be 
selected increasingly by all three groups, as 
child’s age increased (F ratio 3.9, P <..05). 
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In the case of nonlove-oriented ee eni 
but only then, child-rearing pans 
varied significantly as a function of chil ae 
All three authoritarian groups — 
slightly more often with noni oeio Ae 
responses when the a oys, na 

7 g l ratio 5.9, .05). 
poe tey ete variable, there was 
= Cigntticant interaction between child’s sex 
and authoritarianism. 


DISCUSSION 


These findings emphasize a conclusion that 
many have long taken for granted. They 
show that the choice of behavior control 
techniques, in a variety of socialization 
situations, is to a significant extent conditioned 
by measurable personality factors in the 
mother. Whiting and Child (1) have argued 
that parental personality is one critical link 
in the mediation of culture from generation to 
generation. The present study, like theirs, was 
not designed to clarify the antecedents and 
consequents involved in this process, However, 
the results do suggest that different emphases 
or alternative customs of the culture are 


mediated as a function of maternal personality., 


results for 
depends 
assump- 
relative 
ishment 
ritarian- 
author- 
ritarian, 
voidance 
arianism, 
horitarian 


(and other au 


ore closely 
tendencies, 
toward superordinates 
coping techniques”—such 
he maintenance of social 


on between formal roles 
rather than perso 


ns as individuals, etc., —de- 
signed to manage this ambivalence, B 


many writers on authoritarian pers 
have contended, there remains an un 
resentment and hostility toward a 
that is revealed in such displaced {f 
scapegoating and prejudice. 

The speculative tenor of the above remarks 
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could be avoided if the child’s actual behavior 
toward and perception of his parents, and 
reaction to different forms of ee 
pressure, were determined by direct and 
indirect means in subsequent research. 


SUMMARY 


The primary purpose of this study was ee 
investigate the way in which the pries 
authoritarianism affects her selection i 
behavior control techniques in a variety 0 
socialization situations. Specific predictions 
were formulated in an attempt to combine 
the hypotheses and insights of authoritarian 
personality theory (1) with the socialization 
model suggested by Whiting and Child o 
One hundred twenty-six mothers of preschoo! 
children responded in a structured interview 
to 38 behavior situations which commonly 
arise and call for maternal decisions abou 
mode of control. These items were clasate 
as relating to feeding, cleanliness and toi m 
training, sex, aggression, dependence, ne 
independence; the preferred response of po 
mother could in turn be classified as love 
oriented, nonlove-oriented or “ambiguous. be 

The main results of the study may 
summarized as follows: s 

1. As predicted, authoritarians san ae 
consistent tendency to select more non gel 
oriented and fewer love-oriented aor 
than nonauthoritarians. When the m tion 
Tange authoritarians are included, inspec 
indicates the relationship is linear. i the 

2. There are significant variations 1n ted 
Proportion of love- and nonlove-orien jor 
responses selected as a function of the henan 
area involved. Thus, aggression and napa 
ence involve behaviors that provoke Ea 
nonlove-oriented responses, whereas i ae 
dependency behaviors call forth fewer nonlo 
oriented responses. A 

3. There is no significant interak a 
however, between authoritarianism and iori 
havior area. That is, in terms of the a p” HA 
classification of items into six different e ; 
authoritarianism does not discriminate aa a 
cantly better in one area than in anot ae 
It is quite possible, though, that E 
item arrangements (in terms more precisely 
relevant to authoritarianism) would show 
that the authoritarian’s relative preference for 


5 itioned 
nonlove responses is to some extent conditioni 
by the situation. 


x 


CHILD-REARING PRACTICES AND AUTHORITARIAN IDEOLOGY 


REFERENCES 


1. ADORNO, T. W., FRENKEL-BRuNSWIK, ELSE, LEVIN- 
son, D. J., & Sanrorp, R. N. The authoritarian 
personality. New York: Harper & Bros. 1950. 

2. CRONBACH, L. J., & MEENL, P. E. Construct validity 
in psychological tests. Psychol. Bull., 1955, 52, 
281-302. 

3. Hart, I. Maternal child-rearing practices and 
authoritarian ideology. Unpublished doctor’s 
dissertation, Duke Univer., 1956. 

4. Levinson, D. J., & Hurrsan, Puyrus E. Tradi- 
tional family ideology and its relation to per- 
sonality. J. Pers., 1955, 23, 251-273. 


237 


5. Luypquist, E. F. Design and analysis of experiments 
in psychology and education. Boston: Houghton 


Mifflin, 1956. 


6. Sears, R. R., WarTiNe, J. W. M., Nowuts, V., & 
SEARS, PAULINE S. Some child-rearing antecedents 


of aggression and dependency in young children. 


Genet. Psychol. Monogr., 1953, 47, 135 
& Cann, I. L. Child training and 


7. Wamo, W. M, 
personality: A cross-cultural study. 


Yale Univer. Press, 1953. 


Received November 12, 1956. 


-234. 


New Haven: 


CATHARSIS AND THE RELIEF OF HOSTILITY 
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ATHARSIS, in one form or another, 

occupies a central position in all 
C psychotherapeutic procedures. Corsini 
and Rosenberg (4), in reviewing the dynamics 
of group therapy, found that ventilation, or 
the release of feelings and expression of ideas 
usually repressed in other nontherapeutic 
situations, was one of the most frequently 
mentioned mechanisms in about 300 articles 
examined. Despite its importance, considerable 
confusion exists concerning the meaning and 
rationale for the efficacy of catharsis. Rotter 
(12) pointed out the many different ways in 
which catharsis is currently viewed and eval- 
uated the different techniques of attaining 
catharsis from a learning point of view. Dol- 
lard and Miller (5) emphasized the importance 
of the events following the expression of 
feelings or past memories rather than the 
sheer act of catharsis. The therapist as a 
nonpunitive agent helps to extinguish certain 
avoidance reactions or to develop new learning 
by pointing out associations not originally 
perceived by the patient. Alexander and 
French (1) attributed the therapeutic outcome 
following catharsis to insight or new learning 
rather than to any purging effect of the 
unconscious. New learnings may also occur 
merely as a result of new relationships per- 
ceived by the patient as he verbalizes previous 
experiences. 

Relatively few investigations have been 
undertaken to validate the clinical impression 
of the psychotherapeutic efficiency of catharsis 
or test any hypotheses as to its rationale. 
Haggard (7), using electric shock to induce 
disturbance, tested the relative efficacy of 
“catharsis-information” and experimental ex- 
tinction or “rest” on reducing the general 
Jevel of disturbance as measured by autonomic 
variables. Though he found that “catharsis- 
information” was significantly more effective, 
the design did not permit determination of 
whether “catharsis” or the straightforward 
“information” that shock would not be given 
was the essential variable. McKeachie e al. 
(8) found that students who were encouraged 
to write comments about their questions on 


an examination made higher scores than stu- 
dents who had conventional answer sheets. 
The subjects, however, who were instructed 
to write explanations made slightly, but not 
significantly, better scores than those who were 
instructed to write feelings. The results raised 
doubts concerning the efficacy of “blowing-off 
steam.” Wiener (13) investigated the relative 
effectiveness of two counseling techniques: 
reassurance-interpretation and catharsis-reflec- 
tion. Though the combined counseled groups 
showed consistent but not always significant 
improvement in performance in comparison 
with the noncounseled groups, there were no 
indications that either of the two experimental 
counseling techniques was more effective than 
the other. 

These ambiguous and contradictory results 
concerning the efficacy and the nature O 
effects of catharsis may be due to the unteste 
assumption that catharsis in whatever form 15 
equally applicable to all kinds of disturbances 
regardless of the underlying dynamics. It 15 
suggested, however, that catharsis is effective 
to the extent that it sets up conditions for 
reducing threat to need-satisfaction patterns: 
Conflict may arise from a need to express 
hostility or a threat to self-esteem or betwee? 
the moral values of the super-ego and the 
libidinal impulses of the id. Mere expression 
of anxiety feelings concerning an anticipate 
failure, for example, may not, in and of itse™ 
eliminate the underlying threat. An analysis 4 
the source of conflict in the expression °” 
hostility suggests certain hypotheses CO” 
cerning the role of various responses in t e 
reduction of the conflict. The goal of the 
hostility drive is attack and removal of th 
stimulus arousing hostility. The threat f 
such direct behavior, however, is the anti -A 
pated punishment derived from earlier ve 
periences in expressing hostility in this eur y 
It is hypothesized, therefore, that hostii A 
will be reduced if the therapeutic situatio 
provides socially acceptable conditions the 
the expression of hostility or attack at 
instigator. 


r, 
There has been some question (12), howev™” 


238 


CATHARSIS AND THE RELIEF OF HOSTILITY 


concerning the generalization of the relief 
of such feelings when expression of hostility 
is made to a third party (e.g., the therapist) 
or displaced upon some object or person 
other than the original instigator. On the 
one hand, expression directly to the instigator, 
even under nonpunitive conditions, may still 
arouse anxiety concerning the anticipated 
consequences of such attack. On the other 
hand, a number of experiments (9, 10) have 
indicated the generalization effect occurring 
under conditions of displacement. A second 
objective, therefore, of the present series of 
Studies is to compare the effect of expressing 
hostility directly to the instigator, and ex- 
pressing hostility indirectly to or about others. 


METHOD 


Subjects 


From nine sophomore and junior classes in elemen- 
tary and educational psychology, 52 subjects (Ss) 
were selected from each of the three largest groups, 
52 Ss from two smaller groups, and 52 Ss from the 
two smallest groups. The five groups were randomly 
assigned to the experimental treatments and were 
identified as follows: Group I, Catharsis-Experimenter 
(E); Group II, Catharsis-Assistant (A); Group ut 
Catharsis-Fantasy (F); Group IV, Control (Talk); 
Group V, Control (Silence). Sex was about evenly 
divided in all groups: in Group I, there were 23 women 
and 29 men; in Group II, 25 women and 27 men; in 
Group III, 23 women and 29 men; in Group IV, 28 
women and 24 men; and in Group V, 22 women and 


30 men. 


Experimental Tasks 

Basically, the design involved the arousal of hos- 
tility in groups of Ss, exposing them to different con- 
ditions of ventilating feelings, and then testing the 
efficiency of the experimental conditions in the reduc- 
tion of the disruptive effects of stress on performance. 
Hostility was aroused by an insult technique similar 
to the one used by Feshbach (6) with one modification; 
insult was directed at the performance of Ss in an 
ego-involved task, an intelligence test. In the present 
experiment, catharsis refers to the release of hostility 
feelings, verbally expressed by volunteers in a group, 
under nonpunitive conditions. The tasks used to 
assess the degree of the reduction of stress were a 
digit-symbol test and a test of incidental recall of the 
titles of a bogus intelligence test. The basic assumption 
underlying the use of performance measures as indices 
of the disruptive effects © 


f stress follows from the 
interference interpretation of the effects of frustration 


(2) by Child and Waterhouse (3): Responses such as 
anxiety and anger evoked by frustration in one activity 


may interfere with the quality of performance in a 


second activity- 
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Procedure 


To increase involvement, E introduced himself as 
professor of psychology and director of clinical psy- 
chology. Each group of Ss was told that as a part of 
the general routine of the Department of Psychology, 
students were given a series of tests measuring learning 
ability and intelligence. The scores were entered in 
each student’s file for purposes of guidance, recom- 
mendations for scholarships and jobs, and consideration 
for admission to graduate study if the students applied 


for such admission. 

Asa pre-experimental test of learning ability, each 
group was given a one-minute digit-symbol test, the 
code consisting of letters and geometrical figures. 
Then a bogus intelligence test (14) entitled “The 
University of Texas Intelligence Test”, and consisting 
of 14 subtests, each prominently labeled by a single 
word describing the contents, was administered to each 
group. The instructions, read verbally by E, indicated 
that this test consisted of a number of subtests, each 
highly related to academic success, and that the time 
limits of 45 to 90 seconds allowed sufficient time for 
most students to complete each subtest. At the begin- 
ning of each subtest, E said, “This test is called a test 
of _It is a measure of your ability to a 
Are you ready? Begin!” The time limits for the first 
two subtests permitted most of the students to com- 
plete them, but for the remaining 12 subtests, the 
material became much more difficult and the time for 
completion was sufficient for only a very few Ss. In 
order to arouse hostility feelings, Æ went around the 
room, critically remarking at the poor and slow per- 
formance of the Ss, urging more rapid solutions, and 
stating how much faster and better other classes were 
in this test. 

The conditions during the period of eight minutes 
following the intelligence test varied for each group 
depending upon the experimental situations involved. 
In Group I (Catharsis-E), Ss were informed that there 
would be a slight rest period before the next test and 
that during this time the examiner would like to know 
how they felt about the test and the administration 
of the test. For Group I (Catharsis-A), E said that he 
would have to go to his office to bring more test ma- 
terial but that a teaching assistant would come in to 
take over until his return. ‘The assistant, who had 
been instructed beforehand to profess ignorance con- 
cerning the events in the classroom, asked the group 
to inform him as to what had happened and to give 
him their reactions. The role of Æ or assistant for each 
group was merely to accept the statements without 
evaluation or comment except for a “hmm” or repeti- 
tion of the statement. ‘At the end of eight minutes, E 
walked in with more papers and continued the experi- 
ment. During the rest period for Group TIT (Cathar- 
sis-F), E mentioned that the university had surveyed 
student opinion a year ago concerning “irritations at 
the campus.” As a follow-up on the previous survey, 
the class was asked to discuss their evaluation of the 
present situation. 

In order to control the effect of “just talking” as 
against actually expressing hostility, a “neutral” topic 
was assigned for discussion in Group IV (Control- 
Talk). There had been recent publicity concerning the 
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shortage of science majors. The students were asked 
to suggest methods of encouraging more students in 
high schools to major in science. Finally, E lectured 
to Group V (Control-Silence) for eight minutes on the 
Preparation of majors in psychology, allowing neither 
“just talking” nor the expression of hostility. 

Following the “rest” period, each group was given 
another form of the digit-symbol test (numbers and 
different symbols). At the end of one minute, Ss were 
asked to write down the titles or brief descriptions of 
all the subtests in the intelligence test (incidental 
recall). All groups were tested within a two-day period 
to minimize communication to other Ss. 

When all five groups had been tested, E described 
the nature of the experiment and full procedure with 
particular emphasis on the function of the bogus in- 
telligence test to all Ss in order to relieve any doubt 
concerning their intellectual status, 


RESULTS 
Expressions of Hostility 


A graduate assistant, unaware of the ob- 
jectives of the present experiment, sat with 
the Ss in each group and made notes of their 
remarks during the experimental session 
Though it is difficult to assign a global rating 
of the intensity of expression for the group 
as a whole, the nature of the remarks indicated 
rather clearly that reports of expressed 
hostility were more frequent and intense in 
Groups II and TIT than in Group I. Even 
though Ss in Group I were encouraged to 
express their feelings, they gave mild criti- 
cisms only of the test. They suggested that 
the time limits were too short, E did not 
give sufficient time for them to read instruc- 
tions, the items were too difficult, and the 
test was not a valid measure of their intelli- 
gence. One might infer that hostility, ig 
Present, was directed toward the test. I 
Group II, Ss talked more directly about F 
to the assistant. They said he “talked 
E “he was too cocky,” “who did pa 
think he was,” “what à 

; the hell was his name,” 


n TABLE 1 

ADJUSTED POST-TREATMENT 
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“if he kept his mouth shut they could have 
done better.” Group III was very critical 
of staff members and teaching assistants: 
“Professors rewrote their books every year 
so that students have to buy new textbooks,” 
“professors were often unprepared for their 
lectures,” “some had no business to teach,” 
‘why don’t professors stay in their offices 
to see students,” “teaching assistants gave 
unfair exams and didn’t know any more than 
the students.” Ss seemed able to report more 
hostile feelings about the E to somebody else 
than to the Æ directly. When they had to 
express their feelings directly to Æ in Group 
It, however, they expressed more hostility 
about other authority figures (professors and 
assistants) than about E himself, In the control 
Situation of Group IV, Ss suggested methods 
of encouraging more science majors such as 
increased publicity, more use of audiovisual 
techniques in science classes, and emphasis 
upon applications of science courses. In addi- 
tion, however, there were a few critical 
remarks on “poor science teachers” an 
inadequate pay for good teachers.” 


Learnin g 


It was predicted that hostility feelings would 
be reduced under conditions encouraging 
expression of attack or hostility. Specifically, 
therefore, scores on the digit-symbol test 
Should be higher for the three experimenta 
than for the two control conditions. i 

For the digit-symbol test, Table 1 shows 
the Pre-experimental and postexperimenta 
means for all five groups. Since the Pr 
experimental means differed in the five 
groups, an analysis of covariance was applied- 
The adjusted mean scores are shown in the 
last column, 

Since an obtained F ratio of 11.43 for the 
treatment variable was significant well beyo™ 
the .01 level, tests of significance among ¢ x 
five groups of Ss were performed and at 
reported in Table 2. Since adjusted mik 
digit-symbol scores for Groups I, II, and 1 
are significantly higher (.01 level) than those 
for Groups IV and V, the hypothesis see™® 
clearly supported. There is no significa? 
difference, however, among the adjusted means 
of Groups I, II, and IIT. Nevertheless, GrouP 
IV, which merely discussed a neutral top! 
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7 TABLE 2 
os OF SIGNIFICANCE BETWEEN THE ADJUSTED 
MEANS OF THE DIGIT-SYMBOL SCORES FOR THE 
Five Groups 


Groups Adjusted t Ratio P 
Mean Difi. 

I and II .82 — 

I and II 113 18 = 

I and IV 2545 2.43 -01-.02 

Tand V 4.28 4.84 <.01 
II and III .69 -18 = 
a and IV 2.97 3.36 <.01 
a and V 5.10 5.76 <.01 

I and IV 2.28 2.58 <.01 
II and V 4.41 4.98 <.01 
IV and V 2.13 2.41 .01-.02 

TABLE 3 


Tue OF SIGNIFICANCE BETWEEN THE MEANS OF THE 
IVE GROUPS ON THE POST-TREATMENT TEST 
or INCIDENTAL RECALL 


Groups Xb t Ratio P 
I and II 1.21 3.36 <.01 
I and III 77 2.14 02-.05 
I and IV .68 1.95 .05 
I and V .85 2.36 01-.02 
II and III 44 1.22 — 
II and IV 1.89 5.22 <.01 
II and V 2.06 5.72 <.01 
III and IV 1.45 4.00 <.01 
III and V 1.62 4.50 <.01 
IV and V 17 50 — 


ficantly better (.01-.02 level) than 


did signi 
tained silence. 


Group V, where the Ss main 


Incidental Recall 

The results of the test for incidental recall 
not only confirm the findings on the digit- 
symbol test of significantly superior per- 
formance by the three experimental over 
the two control groups, but also point to a 
significantly superior performance by Groups 
II and III over Group I. An obtained F ratio 
of 12.48 was significant beyond the 01 level 
for the treatment variable. As shown in Table 
3, the mean incidental recall scores (number 


of titles recalled) of 6.50 and 6.06 for Groups 


II and Ul, respectively, are significantly 
higher than the mean of 5.29 for Group I. 
All three mean scores for the experimental 
groups are significantly higher than those of 
4.61 and 4.44 for the two control groups 
(Groups IV and V, respectively). No difference 
between the mean scores of Groups II and II 
or between the two control groups was 
demonstrated. Thus, incidental recall follow- 
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ing expression of hostility to the instigator 
about other authority figures or following 
expression of hostility about the instigator 
to another authority figure seems to have a 
significantly more facilitative effect on per- 
formance than the direct expression of hostility 


to the instigator. 


Discussion 


If the learning and incidental recall indices 
can be accepted as indirect indicators of 
the effects of the relief of aggressive tension, 
then the results clearly indicate (a) the 
therapeutic effects of catharsis on the relief 
of aggressive tension, and (b) the differential 
effects of varying cathartic conditions. It 
may be that hostility was relieved by ventila- 
tion or that expression of hostility about Æ 
or other similar figures in the cathartic situa- 
tion permitted a reduction of hostility feel- 
ings towards Æ in the postexperimental test 
situation (generalization). Certain limitations 
in the present experiment, however, should 
be recognized. Since groups rather than 
individuals were employed in the study, 
relatively few Ss reported their feelings 
during the brief cathartic period. It may be 
that the silent members relieved hostility 
feelings vicariously as others expressed ag- 
gression, or the effect may have been largely 
restricted to the more verbal members. The 
experiment warrants replication so that the 
differential effects on the verbal and silent 
members may be studied. 

‘Another factor to be considered in interpret- 
ing the results is the brief and “single-shot” 
nature of the cathartic period. It is possible 
that hostility would be more extensively 
relieved in Group I (direct expression to Æ) if 
a number of cathartic sessions were used. 
A number of trials might be necessary for 
Ss in Group I before the free expression of 
hostility feelings could occur. Though the 
results indicate that hostility feelings were 
reduced as much if not more in Groups II 
and III than in I, nothing can be said as to 
the duration or generalization of these feel- 
ings (postexperimental sessions). Experiments 
on displacement (9, 10, 11) suggest that 
the approach gradient would rise for the 
original goal after approach to similar stimuli. 
Brief encounters with Ss in the corridors 
after the testing of the initial groups and 
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prior to the explanation of the experiment 
suggest that hostility was still strong towards 
E. In the present experiment, the conditions 
in Groups II and III might be conceived as 
involving displacement of hostility feelings. 
In one case, the object of attack, E, was 
kept constant, but the person to whom the 
feelings were expressed varied from E (Group 
T) to the assistant (Group II). In the other 
case, the feelings were expressed to the same 
person, E, but the object of attack varied 
from E (Group I) to other staff members 
(Group II). Traditionally, the latter case 
is more appropriately an example of dis- 
placement since the object of attack is similar 
but not identical to the original instigator. 
The former case, more like the therapeutic 
situation, may nevertheless be considered an 
example of displacement since the punitive 
agent to whom hostility is expressed differs 
from the original. 

One final consideration involves the role 
of E in the two control groups. E had to be 
present in order to equate the condition of 
his presence in Groups I and II, but a more 
complete design to test the effect of the as- 
sistant in Group II should include a replication 
of the two control groups with the assistant 
rather than Æ in charge of the eight-minute 
control period. It is possible that the lecture 
by £ in Group V might have heightened 
temporarily any feelings of hostility, or that 
discussion concerning increasing the number 
of science majors in Group IV may have 
reduced hostility feelings through some critical 
comments about poor teaching. Either situa- 
tion, the lecture by Z in Group V or the discus- 
sion of science majors in Group Iv, might 
explain the significantly better results for 
Group IV on the digit-symbol test, 


SUMMARY 


The purpose of the present study was to 
study the effect of different cathartic tech- 
niques on the relief of hostility feelings, From 
nine sophomore and junior classes in ele- 
mentary and educational psychology, 52 Ss 
were selected from each of the three largest 
groups, 52 Ss from two smaller groups, and 
52 Ss from the two smallest groups. All 
groups were given a bogus intelligence test 
consisting of 14 subtests to produce ego- 
involvement. Hostility was aroused by the 
use of “insult” during the administration of 
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the tests. During the “cathartic” sessions of 
eight minutes, Group I was asked to express 
criticism of the administration of the test. 
For Group II, E mentioned that he was going 
to his office for other test material and that 
an assistant would take over for the next 
few minutes. The assistant professed go 
of the proceedings and asked Ss to descri s 
the previous events and state their Haon 
to them. Group III was requested to ge 
follow-up evaluation of a study made a Taol 
ago on campus irritations. Group IV, a ga al” 
group, was asked to discuss a “neutr ot 
topic, and V, another control group, p 
permitted to say anything while Æ eA 
on the preparation of majors for ig spans 
A pre- and post-treatment diga 5 
test and an incidental recall test of t® test 
titles of the subtests of the intelligence tie 
were used as measures of the effect 
relief of hostility feelings. The results W 
follows: ple to 

1. Ss in Groups IT and IIT were APS jn 
express more hostile feelings than reeling’ 
Group I, who were asked to express 
about Æ directly to him. 

2. Adjusted digit-symbol scores, 
nificantly higher for the three 
groups (I, II, III) than for the two 
groups (IV, V). 

3. There were no significant 1 
in the adjusted mean digit-sym>° 


re as 


among the three experimental a 1 
4. On incidental recall, Ss m han all 

and III were significantly higher 

other groups. her 


‘ hig 
5. Ss in Group I gave significantly . the 
incidental recall scores than those 
Control groups (IV and V). 
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HIS paper presents an experiment on 

reactions to communication and on 

attitude changes by individuals whose 
initial stands on a controversial social issue 
diverged in varying degrees from positions 
advocated in communication. Study of the 
relationship between subject’s (S’s) attitude 
and the position advocated in communication 
may help resolve some apparently contradic- 
tory effects of communication aimed at chang- 
ing attitudes. 

Attempts to change attitudes in the direction 
advocated by communication on a social 
issue at times produce shifts in the direction 
opposite to that intended—the “boomerang 
effect.” While numerous investigators have 
reported shifts of average test scores in the 
direction of communication (4), a fairly com- 
mon finding, even in these studies, is that 
some individuals shift their stand away from 
that presented in communication. Several 
studies reporting both positive and negative 
shifts in attitudes toward out-groups following 
communication are summarized by Williams 
(20). y ; — 

Thus, at times, persuasive communication 
produces a bi-modal distribution of attitude 
scores (11, pp. 874-875). For example, Rem- 
mers (12) obtained positive shifts on average 
scores following communications on conserva- 
tion, social insurance and labor unions, but 
the latter communication “sharply divided 
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the group into two opposing tents” (12, p. 201). 
In Knower’s study prior to repeal of prohibi- 
tion, “wet” communication to generally “dry” 
Ss and “dry” communication to generally 
“wet” Ss resulted in shifts in both positive and 
negative directions in each group (8). A rather 
striking instance of such opposing effects was 
reported by Wilke (18), whose antiwar commu- 
nication was presented at a time when the stu- 
dent population from which Ss were drawn was 
divided in controversy over this very issue. 

A few studies, such as those by Manske 
(10) and Russell and Robertson (14), have 
found group shifts in the direction opposite 
to the stand presented in communication. A 
related finding reported by Williams and 
Remmers (19) following communication on a 
rural issue was reduced variability and a 
comparatively less favorable stand by a group 
of rural youth in contrast to the increased 
variability in an urban group. Some authors 
who have obtained results in the direction 


away from communication insert the eng’ 
tion that too great divergence between », 
4 5 ted in 


stand on the issue and the stand presen 
communication may have been responsible. 
On the other hand, data from several an 
studies suggest that the extent of influen¢ 
increases as a function of the distance — 
position of communication and position of sr 
recipient. Goldberg (2) reports that the goa 
the discrepancy between an announced ae 
norm and the S’s own judgment the ee 
the change produced. In a still unpublis 
paper, French and Gyr (summarized 1 de- 
found a positive correlation between t iictee 
gree of deviation between inducer andin a 
and the amount of change. In a oo | ah 
task, Lubin and Fisher (9) ghen ment 
increase in conformity to a partner’s Ju g here 
as the distance increased up to & point move? 
with great distance the proportion, pi - (6) 
ment declined. Hovland and Pritz T in 
have shown that the larger the erene 
opinion advocated the greater the am! 
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change produced. The issues that were em- 
ployed did not pertain to strongly rooted 
attitudes, but represented opinions on rather 
factual topics. The communicators used were 
authorities respected by the recipients. 

A comprehensive analysis may be necessary 
to handle the above findings as well as results 
pce Mos sh communication on 
stn sial social issues which suggest that 
T great distance between S’s attitude and 

position advocated in communication 
produces “boomerang effects.” One possible 
approach is through analysis of processes 
underlying judgments of motivationally neu- 
tral material, exemplified by psychophysical 
stimuli, as well as ego-involving verbal ma- 
terial, exemplified by controversial social 
Issues. Investigations of judgmental processes 
and their theoretical relevance for the study 
of reaction to communication and attitude 
change will be presented in a forthcoming 
volume. 

One set of findings suggests that S’s position 
shifts toward the stand advocated in com- 
munication when the topic is not highly 
ego-involving and S’s position does not diverge 
in the extreme from the stand advocated. 
This result seems akin to extension of an 
established reference scale in judgment of 
weights or inclinations following introduction 
of anchoring stimuli near the end stimulus of 
the series, as reported by Rogers (13), Heintz 
(3), and others, Sherif, Taub, and Hovland 
(16) demonstrated this extension toward the 
anchor (‘assimilation effect”) as well as the 
effect of more remote anchors beyond either 
end of the stimulus series in constricting the 


reference scale of judgment and displacing 
(“contrast 


Judgment away from the anchor 

effect”), Contrast phenomena may also appear 

in reaction to communication on an Issue 

Which is not highly involving for S if the 

Position advocated is removed sufficiently from 
’s Position, as the Lubin and Fisher results 


Seem to indicate 
_ The above analysis based on findings oa 
Jj gment studies suggested the tine 
a. á established attituc’ 

meade prot troversial 


and are personally involved in a COPD o as 
Social issue, their “own stand” function: 


€ major anchorage affecting reaction to an 
evaluation of communication. In this case, 
Communication near S’s stand would be 
Assimilated to it, while communication at 
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variance with S’s own stand would be displaced 
still further away (“contrast effect”). Whether 
assimilation or contrast effects appear would 
be a function of the relative distance between 
S’s own stand and the position of communica- 
tion. 

Accordingly, in the present experiment 
communications representing two opposite 
extremes and one moderate position on an 
ego-involving issue were presented to Ss whose 
initial stands on the issue ranged from one 
extreme to the other. The following hypotheses 
were formulated in terms of the effect of the 
relative distance between S’s own stand and 
the position advocated upon evaluation and 
placement of communication as well as ac- 
ceptance-rejection of that position: 

1. Reactions to a communication will 
decrease in favorableness as the distance 
between S’s own stand and the position ad- 
vocated in the communication increases. 

2. In evaluations by S of what position is 
advocated by a communication, the greater 
the distance between S’s own stand and the 
position advocated in the communication, the 
greater the displacement away from S’s posi- 
tion (“contrast effect”). When only a small 
discrepancy in position exists there will be a 
tendency for displacement foward S’s stand 
(“assimilation effect”). 

3. With small distances between the posi- 
tion of the communication and that of the 
S, changes in 5S’s opinion in the direction 
advocated by the communication will occur. 
With large distances between the stands taken 

communication and by S, opinion changes 


by i 
in the direction advocated will be infrequent. 


METHOD 


Issue and Communications 
The problem required that a controversial issue be 
selected, that Ss’ stands on the controversy be ascer- 
ications advocating various 


tained, and that communi 1 dvoc: 
positions be presented to Ss with differing stands on 


the issue. 
‘The issue chosen was the controversy over prohibi- 
tion and repeal in a “dry” state. Shortly before the 
study began, a referendum was held to determine the 
i ing prohibition laws. The vote favored 


fate of exist 3 
prohibition by a narrow margin. i 
to differentiate existing stands on the issue, 


In order s 
representative statements made during the referendum 
Ilected from leading newspapers in 


campaign were CO! ed s 
two large cities. In addition, statements were obtained 
from 500 people in several localities on a random basis. 


These statements from public and private sources 
were sorted by 20 judges to secure clearly differentiated 


O raa 
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stands actually taken in the state. As a result, eight 
statements were chosen representing prevailing stands 
ranging from strong advocacy of prohibition to strong 
advocacy of repeal. One additional “wet” statement 
was added as a logical counterpart of the most extreme 
“dry” stand, giving a total of nine statements. These 
statements are as follows, (I) being the additional 
item: 

(A) Since alcohol is the curse of mankind, the sale 
and use of alcohol, including light beer, should be 
completely abolished. 

(B) Since alcohol is the main cause of corruption 
in public life, lawlessness, and immoral acts, its sale 
and use should be prohibited. 

(C) Since it is hard to stop at a reasonable modera- 
tion point in the use of alcohol, it is safer to discourage 

use. 

i (D) Alcohol should not be sold or used except as a 
remedy for snake bites, cramps, colds, fainting, and 
other aches and pains. , 

(E) The arguments in favor and against the sale 
and use of alcohol are nearly equal. 

(F) The sale of alcohol should be so regulated that 
it is available in limited quantities for special occasions. 

(G) The sale and use of alcohol should be permitted 
with proper state controls, so that the revenue from 
taxation may be used for the betterment of schools, 
highways, and other state institutions. 

(H) Since prohibition is a major cause of corrup- 
tion in public life, lawlessness, immoral acts, and juve- 
nile delinquency, the sale and use of alcohol should be 
legalized. , 

(I) It has become evident that man cannot get 
along without alcohol; therefore, there should be no 
restriction whatsoever on its sale and use, 

Three communications of equal length were pre- 
pared, each requiring approximately 15 minutes for 
delivery. Arguments were those actually made by 
prohibition and repeal advocates during the referendum 
campaign. Arguments in the three communications 
were arranged in the same order, but from the view- 
point of the different parties to the controversy. One 
communication presented an extreme “dry” stand; 
one an extreme “wet” stand; and one a moderately 
wet stand, as typified in statement (F) above. All 
communications were presented by tape recording. 
The same voice was used in recording wet and dry 
communications. 


Procedure 


In the first session, data on Ss’ attitudes were ob- 
tained. At the time, Ss were not told that an additional 
session would be held. A schedule on “public issues” 
was presented with assurance of anonymity to Ss. 
Following a “dummy” issue (college football) Ss re- 
sponded to the nine representative statements on 
prohibition. The following instructions were printed 
on the schedule and read aloud by the experimenter 
(E): 

Below are some statements recently made con- 
cerning the wet-dry issue in this region. 

Please read all of the statements carefully first 
before making any marks on this page. 

Now that you have carefully read all the state- 
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ments, underline the one statement that comes 
closest to your own point of view on the topic. 

There may be other statement or statements 
which you find not objectionable from your point 
of view. Put a circle around the letter in front of 
such a statement or statements which are not ob- 
jectionable to you. 

Now cross out that one statement which is most 
objectionable from your point of view, 

There may be other st: 
which you find objectionable from your point of view. 
Cross out the letter in front of sucha statement or 
statements which are objectionable to you. 

These procedures yielded data on S's stand and 
also on the range of his toler; 
the series (latitude of acceptan 
he rejected (latitude of rejec 
which he did not consider 


atement or statements 


first session, a communica- 
tion was presented by tape recording. Arrangements 
for its presentation were made through a member of 
the group being exposed and it was introduced as a 
talk actually made by a Proponent of the stand ad- 
vocated. The wet (repeal) communication was pre- 
sented to extreme dry Ss and unselected Ss. The dry 
(prohibition) communication was Presented to ex- 
treme wet and unselected Ss. The moderate communica- 
tion was presented to wet, dry, and unselected Ss, 
The Ss participated in both “before” and “after”? 
sessions in small groups of 10-30 under close super- 
vision. 

Following the communication, the same question- 
naire for securing S’s stand, latitudes of acceptance 
and rejection was filled in a second time. Reactions 
to the communication presented were obtained through 
ratings on like-dislike, teasonable-unreasonable, biased- 
unbiased, propaganda-fact dimensions. The Ss in the 
moderate communication groups also checked on a 
graphic rating scale ranging from extreme dry (A) 
to extreme wet (I) Positions what they thought to be 
the stand taken in the communication itself, 


Subjects 


r A 
of cases Personally known to the Es or their. the basis 


ir assi 
25 wet Ss were secured. For Comparison, 290 additions} 
Ss were obtained Tepresenting more moderate eaea 
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COMMUNICATIONS FOR Ss HOLDING VARIOUS POSITIONS ON PROHIBITION (BASED ON MEAN 
ACCEPTABLE STATEMENT) 


(Positions of communications indicated by arrow) 


on the issue. These were college students secured 
from classes in journalism, speech, education, chem- 
istry, etc. All Ss were residents of the dry state where 
prohibition was a lively topic of controversy at the 
time of the study. For the reason indicated, it was 
not possible to match the age levels of Ss in the three 
categories, 


RESULTS 


Evaluation of the Communication 

A five-item scale was used to measure the 
audience’s evaluation of the fairness and 
impartiality of the communication. In Figure 
1, the percentages of Ss in each group whose 
reactions were favorable are graphed for the 
three communications. On the abscissa the 
stands of the recipient (mean of statements 
checked acceptable) are presented and on the 
ordinate the percentage of favorable evalua- 
tions. It will be observed that there is an 
extremely close relationship between the 
individual’s own stand on the issue and his 
evaluation of communication. The two com- 
munications advocating extreme positions have 
their peak of favorable responses among 
those holding corresponding extreme positions. 
The maximum favorable reaction for the 
moderate communication is found among 
those holding a moderate position. The data 
provide quantitative information to support 
the expectation from earlier studies (5, 17) 
that individuals who are in favor of the opinion 
advocated will consider the communication 
fair and unbiased, but that those with an 
OPposed stand will regard an identical com- 
munication as propagandistic and unfair. 


Placement of the Position of the Communication 


The principal results of the experiment are 
those pertaining to the recipient’s perception 
of the stand advocated in the communication. 
We asked S to indicate on a graphic scale 
ranging from extreme dry position to extreme 
wet position, what he thought to be the 
position of the moderately wet communication 
(at F). From our previous study on assimila- 
tion-contrast effects with psychophysical data 
we predicted that positions differing only 
slightly from one’s own would be “assimi- 
lated,” while larger differences between one’s 
own position and that of the communication 
would be exaggerated, showing a “contrast” 
effect. In Figure 2, results concerning this 
prediction are presented. The S’s own position 
is indicated on the abscissa and along the ordi- 
nate the average rating of the position of the 
communication is given for the speech in which 
a moderately wet position (about F) was advo- 
cated. The dots indicate the mean placement 
of the position of the communication for Ss 
who indicate each particular position as their 
own stand. The squares represent the mean 
placement of the position advocated when S’s 
position is estimated from the mean of the 
acceptable positions checked. The dotted line 
indicates a hypothetical relationship in which 
Ss holding the same position as the com- 
munication report its position accurately (at 
F), those a small distance removed assimilate 
it to their own position, and those still further 
removed exaggerate the position revealing a 
contrast effect. 
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The expectation is fulfilled that those at the 
wet end judge that the communication ad- 
vocated a drier position than it did and those 
at the dry end judge that it was advocating a 
wetter position than it did. Those nearer the 
position of the communication reported it 
more objectively. The evidence concerning 
assimilation deviates somewhat from the 
theoretical curve. There were too few Ss with 
intermediate positions to enable us to deter- 
mine these positions adequately. 


Acceplance-Rejection of Stands 


Each S was asked to indicate not only the 
stand which best represented his, but also 
other stands which were more or less accept- 
able (i.e., his latitude of acceptance), the 
stand most clearly opposed to his position, and 
others not acceptable (i.e., his latitude of 
rejection). From these data, we find that Ss 
with more extreme positions tend to reject 
more frequently positions not within their 
latitude of acceptance. The Ss with middle 
positions are more apt to rate positions re- 
moved from their own as indifferent. In 
Table 1, data are presented showing the mean 
number of items rated “also acceptable,” 
“unacceptable” and “neither acceptable nor 
unacceptable” by Ss with strong positions 
(A, B, G, H, I) as compared with those by Ss 
with more moderate positions (C, D, E, F). 
It will be observed that a significantly greater 
number of items are judged unacceptable by 
Ss with extreme stands. Smaller differences 
between the groups exist in the number rated 
acceptable. These results tie in with our 
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TABLE 1 


ACCEPTABILITY OF STATEMENTS IN RELATION TO 
EXTREMITY OF SUBJECTS’ POSITION ON ISSUE 


Mean | Mean Mean 


Number of Number of| Number of 


S's Positions N |. Items [Items Not) Ttems 
| Acceptable} Checked Rejected 
1. Extreme (A, | 193 2.81 1.48 4.71 
B, G, H, 1) 
2. Intermediate 37 | 3.05 2.24 3.70 
(C, D, E, F) 
$ <.03 


earlier studies (7, 15) in which we found that 
Ss with extreme positions and considerable 
ego-involvement in an issue have raised thresh- 
olds of acceptance and lowered thresh- 
olds of rejection in placing items concerning 
that issue, resulting in smaller latitudes of 
acceptance and greater latitudes of rejection 
than for Ss with moderate positions. These 
findings can be used in constructing the 
expected latitudes of acceptance and rejection 
of the individual for an ego-involving issue 
once his own position is ascertained and pos- 
sibly in predicting evaluation of communi- 
cation which stands at a given distance from 
his own position. An attempt in this direction 
is presented in the discussion. 


Changes in Opinion Jor Groups with Diferent 
Stands on the Issue 


Finally, it would be expected on the basis 
of the considerations previously discussed 
that those holding a position at great variance 
with that being presented in the communica- 
tion would be relatively little modified in 
their opinion. The index of opinion change was 
based on the mean position of the statements 
checked by Ss as being “acceptable” to them. 
A value of 1 indicates the extreme dry end in 
which only statement A was checked. Cor- 
tespondingly, a value of 9 is at the wet end 
with only statement I checked. The positions 
of the three speeches at B, F, and H would 
thus be represented by values of 2, 6, and 8, 
respectively. 

Table 2 gives the mean position for five 
criterion groups before and after the com- 
munication, together with the average amount 
of change. Only the unselected group exposed 
to the wet communication should be expected 
to show sizeable change. The four other 
groups all have initial positions quite removed 
from the position being advocated. While the 
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TABLE 2 
OPINION CHANGE 


Changes in Mean Acceptable Statement for Ss 
with Differing Initial Stands 


Change in 


5 Before After Direction 
Group N Comm. Comm. of Comm. 
Wet Communication (H) 
Drys 69 2.39 2.34 —.05* 
Unselected 92 5.10 5.65 -+.55* 
Dry Communication (B) 
Wets 25 6.70 6.74 —;04 
nselected 87 5.90 5.78 +.14 
Moderately Wet Communication (F) 
Drys 1142.17 2.26 +.09 


* Difference between changes: 
b = <.03 (one tail), 


TABLE 3 
OPINION CHANGE 


Percentage of Ss Changing i in Direction of 
_ Communication or in uORpoRed Direction 


| Change 
Change | in Di- 
in | rection J 
G ks | No aR Net 
šroup N Dires) Change k Change 
Comm. to 
Comm. 


| l 
Wet Communication (H) 


% % | .% % 
Drys 69 | 27-5 | 49.3 | 23.2] +4.5* 
Unselected 92 | 52.2 | 23.9 | 23.9 | +28.3* 
Dry Communication (B) 
Wets 25 | 24.0 | 56.0 | 20.0 +4.0 
Unselected 87 | 40.2 | 33.4 | 26.4 | +13.8 


Moderately Wet Communication (F) 
114 | 31.6 | 49.1 | 19.3 | +12. 


i) 


Drys 


* Difference between changes: 
p = <.04 (one tail). 


dry and unselected groups exposed to the 
wet communication do differ substantially in 
their initial positions, the positions of the 
wet and unselected groups are similar, since 
the unselected group (college students) ini- 
tially holds a rather wet position. 

As predicted, there is a significantly greater 
change for the unselected group given the 
Wet communication than for the dry group 
(p = <.03). As in all studies in which the 
Variable with which we are concerned cannot 
be directly manipulated, the possibility must 
be considered that there are factors other 
than position on the issue which are correlated 
With it and provide the basis for the relation- 
Ship obtained. 


In Table 3, results are presented for the 
same groups in terms of the percentage of Ss 
who are influenced either in the direction 
advocated by the communication or in the 
direction opposed to it. The net change column 
represents the differences between those 
changed in the direction advocated and those 
changed in the opposite direction. The net 
change is greater among the unselected group 
than among the drys when a wet communica- 
tion is presented (p = <.04). Twenty-eight 
per cent of the former and only four per cent 
of the latter changed in the direction advo- 
cated. It will also be seen that the amount of 
change produced among the drys by a mod- 
erately wet communication (at F) is greater 
than that produced by the more extreme 
wet communication (at H). The net proportion 
changed by the former is 12 per cent, while 
it is only 4 per cent for the latter. This differ- 
ence, however, is not statistically significant 
and, in any case, it is difficult to establish 
that the two different communications were 
inherently equivalent in persuasiveness. 

It will be noted that by confining our com- 
parisons to groups who were initially selected 
as having known differences in their stand on 
the issue (either wet or dry) we were able to 
avoid the regression effects often found in 
studies where amounts of change are com- 
pared for groups differentiated solely on the 
basis of initial scores on the attitude measure, 
Under the latter conditions, those with more 
extreme scores often regress toward the mean 
and this can either obscure the effect or pro- 
duce a relationship artifactually. 

Finally, it will be noted from Table 3 that 
the predominant response among Ss holding 
extreme positions is to remain uninfluenced 
by the communication. Even where Ss in these 
groups were influenced, changes were seldom 
found in the item they checked as best repre- 
senting their own position. Instead, there 
was typically only an increase in the number 
of other statements intermediate between 
their own position and that of the communica- 
tion which they would check as “also accept- 
able.” 


DISCUSSION 


The results presented lend support to the 
three hypotheses pertaining to evaluation 
and placement of communication and attitude 
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TABLE 4 
HYPOTHETICAL LATITUDES oF ACCEPTANCE AND 
REJECTION oF Ss Hotpinc Eacn POSITION 
(Columns show latitude of acceptance (strongly ac- 
cept plus accept), latitude of rejection (strongly reject 
plus reject) and Positions neither acceptable nor unac- 
ceptable to Ss holding given positions.) 


Own Position 

Rating 
Poaha B CD E F G HI 
V 0 X X XX XX XX XX 
3 ware tS x x x x 
c 0 v¥wWwv 0 xX xX xX X 
Dk 8 ¥ wi wo me & 
ÉE xX X 0 ¥Wv¥0x xX 
Ex KK 0 ygyd X 
G x KK e OG Y wiv D 
wn x x O x OM Ke O yw ae y 
I XX XX XX XX X xX 0 Vv W 


Code: Vv = strongly accept; v = accept; 0 = neither accept 
nor reject; X = reject; XX = strongly reject. 
Mean frequencies in above table: 


A,B,G,H,I C,D,E,F 
2.7 3.0 
vat vv es FA 
X and XX 5.0 4.0 


change as a function of the relative distance 
between S’s stand and the position of com- 
munication on an ego-involving issue. Our 
findings suggest that the relative distance 
between Ss’ attitudes and Position of com- 
munication may be useful in explaining 
apparently contradictory effects of communi- 
cation in producing attitude change in the 
intended direction, no change, and change in 
the opposite direction. This approach seems 
to have predictive value for S’s Teactions to 
other positions on an issue and susceptibility 
to change once his own stand is ascertained. i 

On the basis of results in Table 1 and evi- 
dence from previous studies (7, 15), we can 
formalize the pattern of acceptance and Te- 
jection for various positions by Ss holding 
each initial position. In Table 4, the expected 
response to each opinion item is given for Ss 
checking positions from A to I as their most 
acceptable stand. The S’s own stand is indi- 
cated by vv, “also acceptable with y, 
completely unacceptable with XX, other 
stands unacceptable with X, and not clearly 
acceptable or unacceptable with 0. Thus, the 
table presents a hypothetical pattern for the 
“latitude of acceptance” (consisting of S’s 
own position and other acceptable Positions) 
and “latitude of rejection” (consisting of all 
unacceptable positions) for Ss holding each 
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position. Where sufficient numbers of Ss are 
available for a given stand, the empirical 
distributions of responses show a close cor- 
respondence to these hypothetical patterns. 

This model for latitudes of acceptance and 
rejection can be used to explicate results on 
reaction to communication. By comparison 
with Figure 2, showing placement of the 
position advocated in the moderate com- 
munication, it can be seen that “contrast 
effects” begin to appear when the position 
advocated falls within S’s latitude of rejection, 
an area far removed from his own stand. 
If the position advocated in communication 
falls at the limits or slightly beyond the 
latitude of acceptance, it would be more 
likely to be assimilated to the latitude of 
acceptance, although this trend is less clear 
in the present study. 

If we assume that evaluation of communi- 
cation on an issue in which S is personally 
involved depends on two factors: (a) what the 
Position of communications is perceived to be 
and (4) what segment of the acceptance- 
Tejection range is associated with the perceived 
position, we can predict from Fig. 2 and Table 
4 the degree of favorableness of evaluations of 
the communication (Fig. 1). When the com- 
munication falls within the latitude of ac- 
ceptance (V or Vv), it is judged fair and 
unbiased. As the perceived position of com- 
munication moves outside the latitude of 
acceptance into the latitude of rejection (X), 
it is increasingly considered unfair and propa- 
gandistic. For example, an individual checking 
B as his most acceptable stand displaces the 
F communication to G (“contrast effect”) 
which, according to Table 4, i 
latitude of rejection 3 an S checking A as his 
own stand places the F communication be- 
tween G-H, still further toward the extreme 
end of the latitude of rejection, and the per- 
centage of favorable evaluations of the com- 
munication declines still further (cf. Fig. 1), 

Two general problems are Suggested by the 
present analysis. Since S’s stands on an issue 
of personal concern produce variation in reac- 
tion to communication and attitude change, 
the effect of the degree of e€go-involvement in 
issues needs exploration. It is expected that 
latitudes of acceptance and rejection will 
vary as a function of the degree of €go-involve- 
ment, and these variations may affect the 
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ranges in which assimilation and contrast 
effects in judging communication occur. A 
person strongly committed on an issue will be 
more discriminating in accepting stands 
(raised threshold of acceptance) while one who 
is less committed will be willing to consider a 
larger number of alternative positions (lowered 
threshold of acceptance). Corresponding effects 
on extent of opinion change with different 
distances between S and communication are to 
be expected. With low involvement issues one 
would anticipate increase in opinion change 
with increased separation over a considerable 
Tange, whereas with high involvement issues 
one would expect opinion change over only a 
narrow range of separation, with resistance to 
acceptance of the communication for the 
remaining distances. 

Finally, an interesting problem remains as 
to the role of the communicator in influencing 
the range of assimilation. In the Hovland- 
Pritzker study (6) the communicator was an 
authority respected by Ss on the issue pre- 
sented. There the greater the distance between 
Communicator position and that of S the 
greater the opinion change. In the present 
study, on the other hand, the communicator 
was an anonymous individual whose accepta- 
bility might be determined by the stand taken 
by him on the issue under discussion. If he 
differed in position only slightly from S he 
would be regarded as fair and authoritative and 
bring about a shift in opinion, but if he differed 
greatly he would be regarded as incompetent 
and biased and fail to influence S’s opinion. 
To explore this factor and the one of involve- 
ment mentioned in the preceding paragraph, 
a factorially designed experiment will be 
required in which type of communicator 
(positive, negative, and neutral), type of issue 
(high involvement and low involvement), 
and distance between the communicator’s 
Stand and S’s own stand, are all systematically 
varied to permit analysis not only of the main 
effects but also of the more interesting inter- 
action effects. 


SUMMARY 
The relationship between the attitude of the 
Tecipient and the position advocated in a 
Communication was studied under conditions 
Where a communicator not known to S presents 


a point of view on a controversial issue which 
differs from that of S by varying amounts. 

The topic discussed was prohibition of 
alcohol. The Ss came from a dry state where 
this was a lively issue. Each S indicated for a 
series of nine statements ranging from extreme 
dry to extreme wet: (a) the position most 
acceptable to him, (b) other acceptable 
positions, (c) the position most objectionable 
to him, and (d) other objectionable positions. 
Groups of Ss were selected whose stands on the 
issue were known (e.g., WCTU members) 
for comparison with unselected groups of 
college students. In all, 514 Ss were used. In 
a subsequent session “wet” groups received a 
“dry” or “moderately wet” communication; 
“dry” groups received a “wet” or “moderately 
wet” communication; unselected groups re- 
ceived a “dry,” “wet,” or “moderately wet” 
communication. After the communication, 
opinion measurements and reactions to 
communication were obtained. The findings 
were as follows: 

1. When the distance between S’s own 
stand and the position advocated in communi- 
cation is small, the communication is judged 
favorably, as fair and factual. With increasing 
distance, the favorable reaction is sharply 
reduced and the communication perceived as 
propagandistic and unfair. 

2. The Ss whose own stands diverge widely 
from the position advocated perceive the com- 
munication as further removed from their own 
stand than it is (“contrast effect”). The present 
results indicate, though less clearly, that Ss 
whose own stands are close to the position 
advocated perceive the communication as 
closer to their own stand than is the case 
(“assimilation effect”). 

3. The most frequent result for Ss whose 
own stand diverges widely from that advocated 
in communication is to remain unchanged in 
their initial attitudes. More Ss with moderate 
positions closer to the stand in communication 
changed in the direction advocated. 

It is suggested that the relative distance be- 
tween S’s own attitude and communication 
along with S’s latitudes of acceptance and re- 
jection for various stands on the issue may pro- 
vide a basis for predicting reactions to com- 
munication and susceptibility to change. Fur- 
ther research varying degree of ego-involve- 
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ment in issues and attitudes toward the 
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communicator is suggested. 
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COMMONSENSE observation is the 

starting point of our inquiry. If one 

were to enter a room and be intro- 
duced to a Frenchman, a German, and an 
American, it would be likely that one’s first 
'Mpressions of the three would be such as to 
Magnify the nationality difference between our 
new acquaintances. The Frenchman might 
@ppear more “typically French,” the German 
more “typically German,” and the American 
More “typically American.” Our perceptual 
Selectivity would register cues that differentiate 
the three in terms of their most evident differ- 
ence: their nationality. To this commonsense 
observation we must add a second. It is also 
likely that, if one were an American, the 
Frenchman and German would seem more 
“typical” of their nationality than would the 
American to whom we had been introduced. 
And this would perhaps be the more so to the 
degree that we were well acquainted with 


* Americans but had experienced little contact 


With Frenchmen or Germans. 

The matter need not be limited to national- 
ity. Introduced at the same time to two men, 
one as a professor and one as a businessman 
of our own nationality, likely as not we would 
Seek the cues that permitted maximum differ- 
entiation of the two in terms of the difference 
of profession. And again, the degree to which 
this would be so might depend upon how well 
Acquainted we were with practitioners of the 
two professions: the better acquainted, the 
less the likelihood of role stereotyping. If we 
Were familiar with one of the professions, by 
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the same token, the other would undergo a 
stereotyping in our impression formation. 

These commonsense observations; or rather, 
the hypotheses to which they give rise 
and with the testing of which this paper is 
concerned, lead to several thoughts about the 
nature of the process that has come to be called 
“the formation of first impressions.” It suggests 
in the first place that under certain conditions, 
our impressions have the effect of accentuating 
the differences between people encountered 
briefly, and further, that there are other condi- 
tions that reduce this tendency to the accentua- 
tion of difference, Let us state the matter more 
formally in terms of a set of hypotheses. 

Hypothesis 1. If a person be placed in a 
particular category on first encounter (nation- 
ality, role, occupation), the more widespread 
one’s experience with diverse members of the 
category the less will this category member- 
ship affect the impression formed of the person 
encountered. Put concretely, if one knows 
Americans better than Frenchmen or Germans, 
the first impression formed of an American 
will be less determined by his nationality than 
in the other two cases. Or if one knows musi- 
cians better than theoretical physicists, the 
former occupation will affect a first impression 
less than the latter. 

Hypothesis 2. If three pieces of wood, alike 
in all respects save their grain, be viewed 
simultaneously, an observer will be more likely 
to notice grain characteristics in the wood than 
if one of the pieces is presented alone. Put more 
formally, that attribute which distinguishes 
an array of objects will be most salient in 
viewing the array. Given a single object, there 
are no similar constraints on what an individual 
may notice (3). With reference to our own 
experiment, if the individual is asked to form 
impressions of an array of individuals differing 
only in nationality, then nationality will be a 
more determinative factor in impression forma- 
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tion than if the individual is set the task of 
forming an impression of a single individual. 

With this much of an introduction, we may 
turn to the study designed to examine and test 
these hypotheses. 


DESIGN OF THE STUDY 


x The basic design of the study was simple in 
outline. By the use of a brief sketch, a stimulus 
person or stimulus persons were presented to 
our subjects (Ss). The Ss were then given the 
task of checking on a trait list those items they 
thought characteristic of the stimulus person 
as a “first impression.” The stimulus persons 
presented varied in terms of their nationality 
(French, German, or American) and in terms 
of their occupation—unspecified, college pro- 
fessor, or businessman. Moreover, the impres- 
sion formation situation varied, as required by 
our second hypothesis, in terms of whether the 
S had a single person of a particular national- 
ity in mind when forming his impression or had 
in mind three people of different nationality; 
what we shall call the “single-impression” 
and the “‘triple-impression” situations. After 
filling in the impression trait list, Ss were asked 
to indicate, in a manner to be described, the 
bases upon which they had decided to check 
various traits. The Ss were all university 
students and themselves varied in nationality, 
There were university students from Boston; 
Paris and Dijon, in France; and a set of groups 
from Berlin, Hanover, and Cologne, in 
Germany. Students in each country, then, ee 
forming an impression of a compatriot an (ia 
of a foreigner. We may consider now the 
ifics of the design. i 
ve groups of Ss were sige 
“single-impression” situation, a Here em 
Germans, a third French, and a thir me 
can. The total sample was comprised of 5 
Ss of whom 298 were male and 278 female The 
groups who were given the test while assembled, 
varied in size from 13 to 29 Ss. Each $ in these 
groups, as we have noted, formed an impression 
of a single stimulus person. The twenty-seven 
groupsand the number of Ss in each is set forth 
in Table 1. Since no consistent sex differences 
were found, the data for both sexes were 
combined. i . 
Only nine triple-impression groups were 
required to fulfill the design. The total popula- 
tion in these nine groups comprised 218 Ss 


TABLE I 


THE NUMBER OF SUBJECTS IN EACH 
OF TWENTY-SEVEN SINGLE-ĪMPRESSION GROUPS 


| Stimulus persons who are— 


American French German 
Subjects al 
s 
| Ë]g E| s jg 
El 2lZlzlaleleial# 
; S|FlSlealalsleleis 
American 19 | 13 | 21 | 15 | 18 | 22 | 46 | 18 | 19 
French 29 | 22 | 18 | 29 | 28 | 27 | 28 | 28 | 27 
German 20 | 20 | 20 | 20 | 20 | 19 | 20 | 20 | 20 
TABLE 2 


THE NuMBER oF SUBJECTS 1N Eacn or NINE 
TRIPLE-IMPRESSION Groups 


Stimulus persons who are— 


Subjects Amer., Fr., Amer., Fr., Amer., Fr, 
& Ger. & Ger. & Ger., i 
Profs Bus. men Unspec. 
American 26 28 30 
French 26 25 23 
German 20 20 20 


of whom 108 were males and 110 females, 
Each S formed impressions of three stimulus 
persons of different nationality: in a third of 
the groups the stimulus persons were a French, 
American, and German professor; in another 
third, French, American, and German business- 
men; and the remaining third, a French, 
American, and German person of unspecified 
occupation. The design and the number of Ss 
in each group is presented in Table 2. These Sx 
knew in advance, of course, that they were to 
form impressions of people of three different 
nationalities. 

All Ss were given a booklet with the instruc- 
tions not to look ahead in the questionnaire 
and to finish each page before turning to the 
next. Both single-impression groups and triple- 
impression groups were told: “You will be 
asked to form impressions of specific persons in 
this questionnaire.” The latter were further 
instructed: “These people are from different 
countries.” 

The instructions and description of each 
stimulus person were as follows: 


The object of this test is to determine the extent to 
which people are capable of judging a person from @ 
few facts about that person. A brief characterization 
of a specific individual appears below. Read it care- 
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TABLE 3 
Tue Trait List AND Irs GERMAN AND FRENCH EQUIVALENTS 


Synonyms for “Energetic” 


Focused Diffuse 
Trait German French Trait German French 
resolute entschieden résolu bustling geschiiftig agité 
amelo zäh tenace animated angeregt animé 
dogged, verbohrt persévérant spirited munter fougueux 
determined entschlossen décidé vivacious lebhaft vif 
Synonyms for “Intelligent” 
Focused Diffuse 
Trait German French Trait German French 
analytic analytisch analytique wise weise sage 
vrilliant geistreich brillant judicious abwägend judicieux 
giek schnell alerte level-headed einsichtig bien équilibré 
ogical — logisch rationnel 
penetrating durchdringend perspicace prudent klug prudent 
astute scharfsinnig astucieux sensible vernünftig sensé 
clever E gescheit adroit sound gesund doué de bon sens 
discerning scharfblickend doué de discer- unaffected ungeziert simple 
nement 
canny schlau avisé sensitive empfindsam pourvu de flair 
shrewd listig sagace 
cunning verschlagen ruse tactful taktvoll diplomate 
calculating berechnend calculant understanding verständnisvol comprehensif 
d’avance 
unprejudiced vorurteilsfrei impartial 
Synonyms for ‘‘Well-adjusted” 
Focused Diffuse 
Trait German French Trait German French 
self-possessed gelassen maitre de lui cheerful frélich joyeux 
happy glücklich heureux 
philosophical philosophisch philosophe satisfied zufrieden ae de bon 
sor 
deep tief profond 
fully and try to form an image of this person. You will French, and German) those which “best 


be asked to record your impressions. 

He is a very typical There is general 
agreement among those who know him that he is 
intelligent, energetic, and well-adjusted. Now 42 years 
old, he is married and lives in a large city in a 


For the American businessman, the S 


Would be told: 


“He is a very typical American businessman . . . who 

lives in a large city in the United States.” For the 
French college professor, the subjects would be told: 
“He is a very typical French college professor... who 
lives in a large city in France,” etc. 


Next, Ss were asked to turn the page and 
choose from a list of 38 traits (see Table 3, 
Which contains the trait names in English, 


characterized the person described.” The Ss 
were allowed to choose as many as they wished. 

After the trait list had been checked (in the 
case of the triple-impression group, after trait 
lists for all three stimulus persons had been 
checked), Ss were next asked the bases on 
which they had checked specific traits on the 
list. A series of letters were used to describe 
these determinanis of each trait-choice. For 
each trait checked, Ss wrote: 


E if the trait energetic of the stimulus person con- 
tributed to that choice, 

I if the trait intelligent of the stimulus person 
contributed to that choice, 
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W if the trait well-adjusted of the stimulus person 
contributed to that choice, : 

V if the social role contributed to that choice, 

N if the nationality of the stimulus person con- 
tributed to that choice. 


Subjects could use any combination of the 
letters to describe completely the bases for 
each choice. They were encouraged to specify 
freely any other determinants of their choice, 
although few actually did. 

Since we shall rely heavily upon the report 
of Ss concerning the determinants of their 
choice of traits, a word about the “meaning” 
of such reports is necessary. One cannot 
naively assume that Ss “know” what led them 
to check a particular trait. It is as conceivable 
that the degree of one’s hostility toward a 
businessman-father is as much a determinant of 
choice as anything contained in the sketch. 
But it would be just as naive to assume out of 
hand that Ss, university students in this case, 
are completely incapable of “knowing” the 
basis for their selection of traits. The issue is 
not one that can be resolved. Yet, we have 
asked our Ss to indicate the basis of their 
choices. We take their responses as sympto- 
matic not of the “true” basis of choice—what- 
ever that may mean ontologically—but rather 
as a basis for inferring what underlies their 
choice. In fact, the only proper basis of infer- 
ence would be further systematic variation in 
the trait lists used, in the instructions given Ss, 
and in the nature of the responses they gave 
by which we make our inferences about causes. 
We shall go ahead in this paper treating our 
Ss’ reports on determinants as if they could be 
taken as a proper basis of inference. In a final 
section we shall reconsider the matter. 


Design of the Trait List 


The traits used to characterize the stimulus 
persons were constructed in the following way, 
the activity being conducted in English: (a) A 
list of synonyms of the stimulus traits 
“energetic,” “intelligent,” and “well-adjusted” 
were gathered; (b) synonyms were divided into 
dichotomized groupings on the basis of certain 
dimensions. (See Table 3.) Synonyms of the 
stimulus trait “energetic,” for example, were 
subdivided into “focused-energetic” and “dif- 
fuse-energetic.”” Under “focused-energetic” we 
put the traits: bustling, animated, spirited, 
and vivacious. These dichotomies were based 
on hunches concerning shifts in the meaning 
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of stimulus traits when connected with persons 
of various nationalities. 

The final ordering of traits in the list was 
random. Recall that Ss were asked to report 
the determinants of each trait they checked. 
With the exception of two traits all traits on 
the list were found to be determined by the 
stimulus traits for which they were designed 
to be synonyms. So Ss checking “sensible” 
on the trait list would more often report that 
the “intelligent” characteristic of the stimulus 
person determined their choice rather than the 
fact that he was said to be “well-adjusted” or 
“energetic.” 

In both France and Germany the researchers 
were given the American trait list and were 
asked to translate this list into their respective 
languages. A number of problems arose. First, 
it was difficult to establish an exact lexical 
equivalent because either the language did not 
utilize the stimulus traits or the check list 
traits in the same way, or the lexical equivalent 
was not used with the same degree of fre- 
quency in each country according to the judg- 
ment of the respective researchers, In all cases, 
the list was translated by not less than three 
bilingual individuals native to the country, 
and a final discussion was held to decide which 
translation of the equivalent trait in the 
American trait list was to be used, - 

There is no practicable procedure for assur- 
ing that the trait lists in the three languages 
are either denotatively or connotively identical, 
and the results to be reported must be 
evaluated in the light of this fact. A close 
approximation to a test of identity is by the 
use of a matching procedure. We have asked 
English-German and English-French bilinguals 
to match items in the two lists to ascertaiù 
whether, within this universe of traits, identity 
matches could be obtained. But this does not 
obviate the contingency that the items 
matched across languages are not the closes! 
Synonyms possible. All that we cansay is that we 
have taken precautions, Fora fuller account 0 
the Problems of translating traits from onë 
language into another, the reader is referre 
to the reports of Perlmutter, Mayntz, a” 
Hurtig, (6) and Lenneberg and Roberts (4): 
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will the single-impression Ss. A gross test of 
the hypothesis is provided by comparing the 
number of times on the average that Ss in 
the two situations indicated that their choice 
of a trait for characterizing the person was 
based upon knowledge of nationality. Group- 
Ing together all French, American, and German 
stimulus persons without regard to occupation 
and comparing the average number of times 
that Ss justified their choice of a trait by 
reference to nationality, we obtained the con- 
firmatory results presented in Table 4. 

Another way of stating these results is by 
reference to comparisons between specific 
single- and triple-impression groups. There are 
27 single-impression groups as indicated in 
Table 1. For each, there corresponds a compar- 
able record given under triple-impression con- 
ditions. In all 27 of the possible comparisons, 
the triple-impression group shows a greater 
reliance on nationality as a determinant of their 
tesponses, a result of a confidence level well 
beyond the .01 level as determined by the 
Dixon-Mood Sign Test. 

Table 4 also confirms our other hypothesis 
that, in general, one uses nationality deter- 
minants less often in forming one’s image of a 
compatriot than in forming an impression of a 
foreigner. This result was obtained in both 
single-impression and triple-impression situa- 
tions. Comparisons may be made in this way. 
Take first the single-impression situation. For 
each group of Ss, let us compare the use of 
nationality as a determinant in forming an 
impression of a compatriot and of a foreigner. 
Thus, we will compare the use of nationality 
determinants for Americans judging Americans 
with their use of this determinant in judging 
Frenchmen and in judging Germans. We thus 
have two comparisons. We may also compare 
the use of nationality when Americans form an 
image of an American professor as compared 
with their usage in forming an impression of a 
German professor and a French professor. For 
Ss of all nationalities, 18 such comparisons 
are possible for the single-impression situation 
and another 18 for the triple-impression pro- 
cedure. In 34 out of the 36 comparisons thus 
afforded, nationality is used more frequently 
as a determinant in forming an impression of a 
foreigner than of a compatriot, again a statis- 
tically highly reliable result. 

. Table 4 indicates, however, that under 
Single-impression conditions, this tendency to 
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TABLE 4 
AVERAGE NUMBER OF Tines Tuar Trair Is 
CueEcKED ON Basis OF NATIONALITY 
OF STIMULUS PERSON 


French German American 

| Subjects Subjects Subjects 

Group | 5 | 2 5 | E 5 = 
||| $| 2/8/38 

LEI el El eli 

a =z]; O}e/o 
Sing.-Impress. | 4.6 | 2.8 | 3.9 | 1.4 | 3.3) 1.3 
Trip.-Impress. | 6.8 | 6.8 | 6.6 | 4.5 | 5.2) 4.3 


use nationality more often on foreigners than 
on compatriots is considerably more marked 
than under triple-impression conditions? The 
latter conditions seem to have the effect of 
making nationality salient even in sizing up a 
fellow national. We may note, for example, 
that with the French Ss operating under 
triple-impression conditions, nationality deter- 
minants are used as often on compatriots as on 
foreigners. It was in this subgroup that the 
two reversals in trend noted above were found. 

We come now to several corollaries of the 
major hypothesis with which we have just 
been concerned. The first of these has to do 
with the specificity of the category into which 
a stimulus person is “coded.” Recall that Ss 
were given sketches of a college professor, 
a businessman, and a person of unspecified 
vocation varying, of course, in nationality. 
Recall also that the Ss are themselves uni- 
versity students in working contact with pro- 
fessors perhaps more than with businessmen. 
It would follow then that the category ‘‘pro- 
fessors” would be more differentiated for them 
than the category ‘‘businessmen.” We would 
hypothesize, then, that vocation would more 
often be used as a determinant in forming 


? This tendency is statistically a highly reliable one. 
We test significance in the following way. Compare 
the difference of nationality determinants used in 
forming an image of a foreigner and of a compatriot 
under single- and triple-impression conditions. Eighteen 
such comparisons are possible. For example, for French 
Ss operating under single-impression conditions, we 
may compare the difference in number of nationality 
determinants used in forming an impression of a typical 
American professor and a typical French professor 
(3.00 — 2.10 = .90) with the difference obtained for 
the same stimulus persons judged under triple-im- 
pression conditions (6.54 — 5.96 = .58). In 17 of the 
18 comparisons possible, the differences were greater 
for the single-impression groups, a result significant 
beyond the .01 level. 
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TABLE 5 
AVERAGE NuMBER OF Toms Tuar Trait Is 
CHECKED ON Basis or VOCATION OF 
STIMULUS PERSON 
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an impression of a businessman than of a 
professor, at least amongst our university- 
based subjects. Such is indeed the case. 
Consider first the nine  single-impression 
groups. For American Ss, for example, we can 
ask whether vocation is more often used as a 
basis for forming an impression of an ‘‘Ameri- 
can businessman” or an “American professor.” 
We can also compare German professor and 
businessman and French professor and business- 
man for this group of Ss and for Ss of other 
nationalities. This gives us nine possible com- 
parisons. In all nine of these comparisons, 
the results come out as expected: vocation is 
more frequently named as a basis of checking 
list traits for businessman than for professor. 
These results are summarized in Table 5 where 
the material for the triple-impression condition 
is also presented. In the latter case, the picture 
is less clear: six of the nine comparisons are in 
the expected direction. If the 18 comparisons 
be grouped, the resultant figure of 15 in 18 is 
significant by the Sign test at the 01 level. If 
the reader examines the reversals in Table 5, 
he will note that they are more of the order of 
ties than of reversals in two of the three cases, 


Tue NATURE OF THE IMPRESSIONS FORMED 


Our object in the present report is to con- 
sider principally the nature of the impression- 
forming process and the factors that may infly- 
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ence this process. In the preceding section, the 
emphasis has been upon determinants, and in 
the one that follows it will be upon the facets of 
the images formed by our Ss as inferred from 
the traits they actually checked in character- 
izing the various stimulus persons presented. 

A full presentation of the traits ascribed to 
Americans, Frenchmen, and Germans of 
different occupation by respondents of these 
several nationalities would be a forbidding 
undertaking. There are 36 groups of Ss, each 
with a somewhat different kind of stimulus 
person, at least different in terms of conditions 
under which presented. And there are 38 
traits which may be checked. 

One approach is to choose a particularly 
interesting stimulus person—one whose image 
seems to loom large in the political and social 
sphere: the American businessman. We will 
limit our discussion to him as illustrative.3 

To reduce the complexity of the image that 
emerges, we limit ourselves to a discussion of 
those traits that are checked by at least 50 
per cent of respondents from a given country. 
These we may regard as “consensual” impres- 
sions. In Table 6 are set forth those traits in 
which such a consensus was found with respect 
to the image of the American businessman, All 
three nationalities, regardless of the method 
of testing, agree that the American businessman 
is “determined,” and there is also rather wide 
agreement that he is “sensible.” French re- 
spondents rather agree with their American 
fellows in seeing him as “satisfied” and “level 
headed”’—rather a sobersided image—whereas 
the Germans appear to emphasize a constella- 
tion of “resolute,” “tenacious,” and “calcu- 
lating.” It is difficult, of course, to extract an 
organized image from a set of checked items on 
a trait list; and also from the items that go 
unchecked. For as Asch has so wisely pointed 
out and demonstrated (2), traits fuse together 
In an organization, and the organization is 
often dominated by a single trait. We have no 
way of determining from our data, for example, 
whether “calculating” is the organizer of many 
German impressions, or “resolute” of French 
images. 

There is not a very sharp difference in the 
mages formed by different nationalities, a fair 
and considerably above-chance agreement 


3A detailed qualitative analysis of the German 
findings has been Published in Germany (6). In this 
study, individual differences in the formation of im- 
Pressions of foreigner and compatriot are discussed. 
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TABLE 6 
Traits AGREED Upon BY HALF OR More or SUBJECTS oF EACH NATIONALITY IN CHARACTERIZING 


AN AMERICAN BUSINESSMAN 


Single-Impression 


Triple-Impression 


Americans French German Americans French German 
cheerful joyeux cheerful joyeux 
tactful diplomate tactful 
Sound doué de bon sens 
Satisfied content de bon satisfied content de bon 
A sort sort 
logical rationnel logisch 
résolu entschieden résolu entschieden 
tenace zäh tenace zäh | 
5 simple ungeziert unaffected ungeziert 
b ustling geschäftig geschäftig 
sensible sensé vernünftig sensible 
ctermined décidé entschlossen determined décidé entschlossen 
clever clever adroit 
nappy happy heureux 
patural naturlich 
level-headed bien équilibré level-headed bien équilibré 
understanding 
pourvu de flair 
astucieux 
maître de lui maître de lui gelassen 
avisé avisé 
calculant calculant berechnend 
d’avance d’avance 
shrewd 
alerte schnell 
animé 
abwiigend 
scharfblickend 


being the rule for Americans, Germans, and 
Frenchmen checking a trait list for stimulus 
Persons of any nationality. We shall not go into 
detail on this matter, for it is a subject with 
little issue. Partly it is an artifact of synonymy: 
an American who is “intelligent” and ‘“well- 
adjusted” as presented ina sketch is more likely 
than not to be called “astute” by a fellow- 
American, ‘“‘scharfsinnig”’ by a German, and 
“astucieux” by a Frenchman. But, then, so is 
a German stimulus person so characterized. 


RECAPITULATION 


_ Our first and perhaps most general hypothesis 
18 that if objects that are alike in all respects 
Save one are considered together, their difference 
M this one respect will be more critical in the 
impression one forms of the objects. Three 
identical triplets, differing only in the color of 
Hg they are sporting, will be seen and inter- 
Preted more in terms of their tie-wearing habits 
than would be the case if each one were en- 
©ountered singly and without the possibility of 
a simultaneous comparison. A corollary of this 
see esis to which we have addressed our- 

Ves is: If in forming impressions of foreigners 


and compatriots one is thinking in a compara- 
tive context, with the different nationalities in 
mind while forming one’s impression, then the 
degree to which nationality will influence the 
impressions formed will be increased. The 
comparative context, often celebrated rather 
uncritically as a facilitator of panhuman 
perceptiveness, may not always have such an 
effect. In crosscultural contact, an individual 
may in fact exaggerate the impressions of 
differentness between foreigner and com- 
patriot. Indeed, anthropologists have often 
been criticized for their description of the gulf 
between members of different cultures, partic- 
ularly by those who, like Allport (1), have 
tried to emphasize the communalities in the 
human condition. It may indeed be the special 
comparative perspective of the anthropologist 
that produces this emphasis, if indeed it is 
such. 

Our second hypothesis is similarly simple. 
If a person or object be “placed” or classified 
in an undifferentiated category with the mem- 
bers of which one has had little experience, 
the effect will be for the general properties of 
the undifferentiated category to have a greater 
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effect on the impression formed of the individ- 
ual so placed than if the placement had been 
in a more differentiated category. Category 
membership, in brief, will have a more telling 
effect on the impression formed of one of its 
members in the degree to which the category 
is differentiated. More differentiated categories 
have less “stereotyping” effect than less differ- 
entiated ones. Thus, the nationality of a person 
will stereotype one’s impression of him to the 
degree that the nationality is well or poorly 
known to one. So too with occupation. A 
college professor will do little stereotyping on 
the basis of being told that a man he ismeeting 
teaches at a university. Should he know 
buinessmen less well, the announcement that 
the man in question earns his living in com- 
merce will have a greater effect on the impres- 
sion formed. To put the matter more elegantly, 
a differentiated category is one in which there 
is a better representation of the likelihood 
with which specific traits are associated with 
all or most of its members. 

Our two principal hypotheses leave much 
ground still unturned in the matter of how 
one forms impressions of compatriots and 
foreigners, but our findings do serve to sharpen 
up a few issues. Impression formation depends 
in massive degree on categorization processes, 
In essence, we place a person or thing ina 
category on the basis of a few minimal cues 
—like a statement of his nationality or 
occupation—and then proceed to “run off” 
along the lines of the higher probability at- 
tributes associated with people or events in- 
cluded in the category. As noted in a recent 
work of Bruner, Goodnow, and Austin (3), the 
attributes that will be emphasized under these 
circumstances are dictated at least in part by 
the nature of the discrimination that the im- 
pression-former has to make. If one should be 
asked for the characteristics of man that dis- 
tinguish him from bears, one set of attributes 
will become salient. If the task is to distinguish 
man from all other species, we may have re- 
course to the old characterization of man as a 
featherless biped. 

We turn at last to the methodological prob- 
lem introduced earlier. We have asked our Ss 
first to check certain traits that characterize a 
person presented in a sketch, and then to in- 
dicate the basis for doing so—what in the 
sketch prompted the choice of a given trait. 
Much of our reasoning has been based on the 
latter of these data: notably, the reliance on 
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nationality as a basis for checking traits. It is 
fashionable to be apologetic about utilizing 
such direct reports as a basis for drawing in- 
ferences. We do not feel that the matter can 
be resolved within the compass of the data 
available to us. This much is clear. Highly 
consistent differences have been found, differ- 
ences that have a congruency 
comparison to comparison. 
thought that their choice of traits was being 
determined by certain features of the sketches 
presented to them or they were in some degree 
so determined. The Proper skepticism requires 
that the matter be left open pending further 
tesearch—research designed to compare direct 
reporting with such other indirect methods of 
response as one may find in the armamentarium 
of projective tests or psychophysiological pro- 
cedures. Any one such procedure would be as 
suspect inferentially as the direct reports of our 
Ss, no more and no less so. Ideally, inferences 
should be drawn from a very wide sampling of 
different kinds of responses. Let the reader 
suspend judgment in the present case, or better 
still, let him consider whether at this stage of 
research it is better to commit a Type I ora 
Type II error. 

One final word is in order. Impression forma- 
tion is not a Separate sort of cognitive activity 
and we are in the debt of Asch (2) for making 
this clear. It is a phenomenon that requires 
close analysis in terms of cognitive theory— 
whether the theory be associationistic, Gestalt, 


or the type of information-utilization theory 
that has informed the prese; 


as one goes from 
Either our Ss 


nt enterprise. 
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CASE REPORT 


THE TREATMENT OF TWO PHOBIC PATIENTS ON THE BASIS OF 
LEARNING PRINCIPLES' 


VICTOR MEYER 
Institute of Psychiatry, University of London, Maudsley Hospital 


HIS paper is presented as an illustration 
of the possibility of applying experi- 
i mental method and some of the prin- 
ciples available in psychological theories of 
learning to the treatment of psychiatric symp- 
toms. As such, it is somewhat similar to an 
earlier study by Jones (7). It differs from the 
latter, however, in that it demonstrates the im- 
Portance of individual differences in person- 
ality structure and the manner in which such 
differences dictate modifications of therapeutic 
techniques. 

The treatments to be described were based 
mainly on the principle of primary stimulus 
generalisation (5). According to this principle, 
a conditioned reaction is not only evoked by 
the original conditioned stimulus but also, to 
Some degree, by a series of stimuli more or less 
Similar to the original one. The more dissimilar 
the new stimulus, the less intense is the condi- 
tioned reaction. 

Many phobic patients display anxiety so 
intense when presented with certain environ- 
mental situations that they are not amenable 
to any therapeutic approach. However, the 
generalisation principle suggests that the 
presentation of an environmental situation 
which resembles the original one but evokes a 
reduced amount of anxiety can be exploited 
therapeutically. In such a situation, one can 
attempt to substitute an adaptive normal 
reaction for the original unadaptive anxiety 
Tesponse. A graded continuum of similar situa- 
tions can then be employed, moving towards the 
Original one, so that the original unadaptive 
response can be eliminated. This is essentially 
One of the methods advocated by Jersild and 

olmes (6) for the treatment of children’s fear, 
and its theoretical basis has been elaborated by 
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such writers as Guthrie (4) and Wolpe (8). The 
latter, among others, has demonstrated the 
usefulness of these techniques in experiments 
on animals and has applied them successfully 
in the treatment of a series of psychiatric 
patients (9). 


SUMMARY OF CASE HISTORIES 


For the purposes of this paper, a complete 
case history of each patient is not essential. A 
brief summary of the psychiatric notes will 
suffice. 


Case 1 
The patient, a married woman of 48 years, 


was admitted to the hospital for investigation 
of suspected temporal lobe epilepsy. Apart 
from blackouts, she complained of an excessive 
fear of going out. 

No family history of psychiatric disorder or 
epilepsy was reported. The birth and develop- 
mental milestones seemed normal. She left 
school at the age of 15 and had a successful 
work history up to the time of her illness. Her 
marriage was fairly satisfactory, but there were 
no children. Before her illness, she was de- 
scribed as quick-tempered but a good mixer, 
cheerful, hardworking, and conscientious. 

Her blackouts started at the age of 28 during 
pregnancy. Since then, she had suffered 18 
blackouts. They all followed physical exertion 
or emotional upset and were usually preceded 
by an aura of dull sensation in the chest. She 
usually lost consciousness, fell limp, and hurt 
herself on several occasions. On recovery she 
vomited and felt “horrible.” 

At the age of 44 she began having peculiar 
“feelings.” These consisted of epigastric sensa- 
tions, followed by a “horrible” sensation of 
being about to fall. These episodes occurred 
frequently when she was about to go out or 
when she was outside on her own. Occasionally, 
when she was unable to control these feelings, 
she tended to panic and then to suffer one of 
her blackouts. 

Asa result of these symptoms, she developed 
a strong fear of going out on her own, of 
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traveling on public conveyances and of having 

sexual intercourse for fear of an attack. From 
the age of 46 she refused to go out unaccom- 
panied. Shortly before the admission to the hos- 
pital, she was occasionally unable to go out 
with friends or even with her husband. She 
gradually became more and more anxious, de- 
pressed (entertaining suicidal thoughts), and 
fearful. Losing confidence, she resigned from a 
responsible job and stopped caring for her 
home. She continued working until admission, 
but needed to be accompanied all the way to 
and from work. On admission she was pleasant, 
sociable, cooperative, rational, and well 
oriented. 

The results of all investigations were nega- 
tive with respect to temporal lobe epilepsy. 
Except for some mild abnormalities shown on 
the air-encephalogram, all the tests contrain- 
dicated organic involvement. In view of the 
possibility that the attacks might be of cardiac 
origin, she was referred to cardiologist. He 
could find no abnormalities in the cardio- 
vascular system. 

The patient was diagnosed as exhibiting a 
phobic state with blackouts of unknown 
nature and was referred to the Psychology 
Department for the symptomatic treatment 
of her phobias. 


Case 2 


A man of 42 years was admitted to the hos- 
pital for investigation of blackouts. He also 
suffered from excessive tension and various 
fears, mainly related to entering enclosed and 
crowded spaces. 

Two instances of psychiatric disorder were 
reported in the family history: His mother had 
a “nervous breakdown,” and his elder brother 
experienced difficulty in going out alone. The 
patient’s early development and childhood 
seemed normal. He left school at the age of 14 
and worked successfully as a precision grinder. 
He had been happily married and had two 
children. Before his illness he was described as 
undemonstrative and timid, but affectionate 
as a father and husband. 

At the age of 22 he had several fainting at- 
tacks on parades when serving in the armed 
forces. Following these, he developed episodes 
of apprehension which usually occurred in 
crowded places. He felt tense, flustered, and 

experienced an urge to run away in a panic. 
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Strange or crowded places and feelings of bore- 
dom were associated with the onset of these 
episodes. The presence of a friend and strong 
interest tended to prevent their development. 
Up to the age of 38, he could preserve control 
and avert panic. After that time, control be- 
came more tenuous. On four occasions, episodes 
of panic terminated in fainting attacks. As a 
result, he became worried about himself and 
sharply curtailed his social activities. Shortly 
before admission to the hospital, he had a 
feeling of apprehension in a barber’s shop, 
panicked, mounted his bicycle and shortly 
afterwards collapsed. After this accident he 
refused to ride the bicycle and would not go 
out. He became anxious, tense, and depressed. 
He was treated by his own physician with 
phenobarbitone with very little success. On 
examination for possible temporal lobe epi- 
lepsy, the results were negative. When ad- 
mitted to the hospital, he appeared very tense 
and anxious, but rational, well oriented, and 
cooperative. He was reluctant to talk about 
himself and needed much encouragement. 
Like the first case, he was referred to the Psy- 


chology Department for symptomatic treat- 
ment of a phobic state. 


TREATMENT 
Case 1 


The patient was interviewed twice with the 
aim of getting detailed information about her 
phobic symptoms. The rationale of the treat- 
ment was then explained and discussed with 
her. During this time the experimenter (£) 
endeavored to establish good rapport. Ac- 
cording to the theory outlined, this is essential 
since E becomes part of the various environ- 
mental situations during treatment and, by 
establishing an effective relationship, he be- 
comes a reassuring stimulus, tending to reduce 
anxiety. 

No exact record of each treatment session 
was kept and, therefore, some of the descrip- 
tion lacks precision. Since the most likely place 
for the abnormal behavior to occur was a door 
leading to the “outside” and the “outside” 
itself, the hospital roof garden, which has 
walls but no ceiling, was selected as a starting 
point for treatment. On the first occasion, the 
subject (S) entered the garden with Æ. She 
only reported mild “thumping in the stomach 
throughout this session. Next day she reporte 
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feeling perfectly well and volunteered to enter 
the garden on her own. When she had gone 
twice on her own without any signs of the 
symptoms recurring, the place of treatment was 
changed to the main hospital garden. She 
went out several times with Æ on successive 
days, then was required to go out on her own 
to meet £, and then to seek and find him. 
Since S appeared rather dependent on E’s 
Presence, other staff members of the ward 
were asked to participate in the treatment 
Without .S’s knowledge of prearrangement by 
E. She went out with various nurses and pa- 
tients for walks in the garden. Similar expedi- 
tions took place during the evening. Not more 
than two sessions (30 minutes each) were 
given each day. When asked to go into the 
garden on her own, she did without any diffi- 
culty. From that point, she was encouraged to 
take frequent walks into the garden during the 
daytime and the evenings. 

The next stage consisted of £’s taking walks 
with the patient outside the hospital grounds. 
These started in back streets and were grad- 
ually extended into the main street; the dis- 
tance covered increased gradually day by day. 
She also went out with other people. Even- 
tually, she was able to set out on her own to 
meet £ and others outside. She was taken out 
at least three times when feeling “upset.” She 
took four bus rides accompanied by £ and fre- 
quently volunteered to go shopping with 
other patients. After two weeks (16 sessions) of 
this treatment, she was able to make short ex- 
peditions on her own without difficulty. 

She spent the subsequent 16 days going for 
walks with E and other people and on her own. 
Short trips on buses were included, and at 
least three times she went out on her own in 
the evening. She spent three weekends at 
home but was instructed to go out on her own 
only if she felt confident. She managed to take 
three short walks near her house. On the last 
day of treatment she traveled some three miles 
away from the hospital and back. During the 
Whole treatment, she never reported any 
Symptoms except occasional mild thumping in 
the stomach. Her behavior on the ward im- 
Proved. According to the psychiatrist in charge 
and the nursing staff, she had very few “up- 
Sets” and was more cheerful. She felt confident 
and eager to be discharged. 

The treatment took about five weeks, and 


the patient claimed to have enjoyed it. When 
she was discharged, her relatives and friends 
were told that she should be encouraged to go 
out on her own provided she felt confident and 
not upset. It was planned that she should be 
seen once every two weeks in the outpatient 
department for follow-up purposes and, if 
necessary, to modify management. 34/54 

Follow-up. For nearly five months the pa- 
tient worked regularly and was unaccom- 
panied when visiting the hospital. She took 
occasional solitary walks and was able to do 
her shopping. Each working day she was met 
at the station by a friend who traveled with 
her to the factory; on the return journey she 
was met by her husband. She managed to get 
to the station on her own, and on two occasions 
when the friend failed to meet her, she experi- 
enced no difficulties. A month after discharge 
from the hospital, she developed an acute chest 
pain for which she was treated by her doctor. 
Also, while on a bus with some friends, she had 
one of her blackouts without any warning and 
was unconscious for about 45 minutes. Despite 
these difficulties and worries, she continued 
to work and to go out on her own. At the end 
of the fourth month, she increased the fre- 
quency of her unaccompanied expeditions, and 
for the first time in two years attended ade- 
quately to her housework. She also went to 
work on her own seven times in succession 
when her friend was on vacation. She reported 
occasional feelings of anxiety, but she never 
panicked and was able to ward off the attacks 
by thinking about treatment and handling a 
mascot (a piece of candy) which she kept 
throughout the treatment. The last time she 
was seen, she felt cheerful, confident about 
going out on her own, and practically cured of 
“unreasonable fears.” 

Three days after the last interview, the pa- 
tient died following one of her sudden black- 
outs when travelling with friends. The cause 
of her death was attributed to left ventricular 
failure; postmortem examination demonstrated 
heart disease. 


Case 2 


Two extensive interviews with this patient 
indicated that his anxiety was not associated 
with any specific stimulus but was more gen- 
eralised and might depend on a variety of 
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factors. For this reason, a modified technique 
was adopted, aimed at enabling S, by a process 
of conditioning, to cope efficiently with his 
anxiety no matter where it arose. First, a 
course of systematic desensitisation was given, 
based on Wolpe’s (9) relaxation techniques. 
This form of treatment failed, since the patient 
reported that he could not evoke any increased 
anxiety by thinking about stressful situations. 
He remained tense and anxious throughout 
the sessions, but reported no changes in feel- 
ing; GSRs similarly indicated no disturbances. 

The conditioning of anxiety-relief responses 
was then attempted by means of a technique 
similar to that described by Wolpe (9). An 
inductorium with a maximum inflow of 6 volts 
was applied to two fingers; the strength of the 
current was controlled by turning a knob. The 
patient was told to say aloud “calm yourself” 
when the shock became unbearable. As soon 
as the patient said this, Æ switched off the 
current. Wolpe has reported that many of his 
patients experienced relief from anxiety in dis- 
turbing situations when they used these words 
associated with the release from shock-induced 
tension. 

Since it appeared that S was most likely to 
manifest his symptoms in a crowded cinema, 
it was decided to initiate a procedure similar 
to that used with the first patient. It was 
planned to visit the local cinema at a time 
when it is usually relatively empty and then 
gradually delay visits, going later and later 
each day until eventually the visit would be 
made at a time when the cinema is commonly 
quite full. The effect of boredom was also 
taken into account since the patient saw at 
least part of the same picture each day. During 
the first week, the patient received 5 or 6 elec. 
trical conditioning trials immediately before 
visiting the cinema with Æ. After some thirty 
to forty minutes, both S and Æ returned to the 
hospital, where 5 or 6 further conditioning 
trials were given. 

After seven days of this treatment, there was 
no change in 5s behavior. During most of the 


visits to the cinema, he reported feelings of 
tension and an D i 


proved by the treatment. 
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In every respect, the patient’s reactions to 
the treatment differed from those manifested 
by Case 1. Whereas Case 1 reacted very 
quickly to the treatment and a good rapport 
was easily established, Case 2 remained aloof 
and detached. 

At this stage, Case 2 was reconsidered. A 
plausible hypothesis was derived from Franks’s 
study (2) concerning the conditionability of 
extraverts and introverts and from Eysenck’s 
theory of anxiety and hysteria (1). From these 
ideas, it seemed to follow that Case 1 would 
condition better and extinguish less rapidly 
than Case 2. Similarly, both cases should be 
more neurotic than average, and Case 1 should 
be more introverted than Case 2. These infer- 
ences were supported by the patients’ scores 
on Franks’s eyeblink conditionability measure 
and on the Maudsley Personality Inventory, 
On the theory? that inhibitory drugs decrease 
conditionability and heighten satiation effects, 
Case 2 was given 10 milligrams of dexedrine 
(excitant) for four days. As soon as the effects 
of the drug became apparent, he was submitted 
each day to the same treatment procedure as 
during the previous week, 

Throughout this second stage of treatment, 
the patient felt cheerful and relaxed and en- 
joyed going to the cinema. At no time did he 
display any symptoms. On the second day, he 
was left alone in the cinema. On the last day, 
he managed to stay alone ina crowded cinema 
without any difficulty. During the followin 
weekend he went home but took 10 milligrams 
of dexedrine each day. He traveled alone on 
buses, rode his bicycle, and went to a cinema 
with his wife. He reported no feelings of dis- 
turbance. On his return from the weekend, the 
dosage of dexedrine was reduced by 2.5 milli- 
grams a day with an equivalent amount of 
Placebo substituted without the patient’s 
knowledge. The conditioning procedure and 
the visits to the cinema on his own continued 
daily after the administration of the drug and 
the placebo. On the fourth day, he received 10 


milligrams of placebo only. During this stage 


* There is some evidence to support this theoretical 
Position. Unpublished studies at Maudsley Hospital on 
amobarbital sodium and dexamphetamine sulfate by 
Franks and Trouton, and on arecoline and methyl 
atropine by Franks, Laverly, and Trouton have yielded 
results consistent with the notions applied therapeu- 
tically in the present study. 
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of treatment, he gradually became tense and 
anxious and reported feeling slightly depressed. 
After each treatment session, however, he 
said that he felt better temporarily and at no 
time did he feel tense and anxious in the 
cinema. 

_ During the next weekend at home, he was 
Siven one placebo tablet for the Saturday and 
none for the Sunday. He coped with traveling 
and riding his bicycle, but felt somewhat tense 
and anxious. He maintained that the cause of 
these symptoms was the fear that the treat- 
ment might fail and that he might have to 
Stay in the hospital. He expressed a strong 
desire to leave, saying that he had not felt this 
desire before the treatment started. He also 
claimed that his present anxiety and tension 
had a “different quality since he could find a 
reason for these feelings.” 

The first day after his return from this week- 
end, he was given a final test. He had to travel 
on a bus some three miles away from the hos- 
pital, return, and then visit, unaccompanied, 
another very crowded cinema. He reported 
that before the bus ride he felt tense and 
worried, but on his return, he had recovered, 
regained his confidence, and felt cheerful. He 
also enjoyed the cinema and reported that he 
“never felt better.” 

The rationale of the treatment was explained 
to him and every step of the treatment (except 
the substitution of placebo) was discussed with 
him. After the final test he was discharged. 


Follow-up. It was planned to see the patient 
once every two weeks. He kept his first appoint- 
ment only, reporting considerable improvement 
in travelling and in visiting crowded places. He 
had also gained confidence in himself and in 
relation to strangers and his superiors. He still 
occasionally got his “queer feelings” of worry, 
Strain, and fear of fainting, but these were very 
mild and not very disturbing. The only diff- 
culty he experienced was when riding his 
bicycle, but he continued to cycle to and from 
work. Two weeks later he sent a letter, saying 
that he could not keep the appointment since 
he was in financial difficulties. Apart from 
these difficulties, he was managing very well 
and had mastered his fear of cycling com- 
pletely. He could cope easily with any uncom- 
fortable feelings he experienced. 

Three months later he sent another letter, 
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informing Æ that he was progressing very well. 
He had taken his children to the coast for a 
holiday, cycled to and from work without any 
difficulty, and visited cinemas regularly. 


DISCUSSION 


This paper attempts to demonstrate the ap- 
plication of some aspects of learning and the 
personality theory to the symptomatic treat- 
ment of psychiatric patients. At present, the 
experimental findings are not completely con- 
sistent, and the theories themselves are in early 
stages of development. The available findings, 
however, suggest rational experimental tech- 
niques for symptomatic treatment. 

It cannot be strongly maintained that a 
mere stay in the hospital could account for the 
recovery from the symptoms, since both pa- 
tients stayed in hospital well over a month 
prior to treatment, and according to their 
verbal account and the observation of their 
behavior in stressful situations, there was no 
indication of any improvement. In the present 
state of knowledge, however, it cannot be 
argued with any degree of assurance that the 
improvement of these patients’ long-standing 
phobic symptoms was due to the treatment 
given. Even if there was sufficient evidence to 
maintain that the treatment was responsible 
for the improvement, so many relevant factors 
were uncontrolled that one would not know 
which aspect of the treatment was relevant and 
which irrelevant. These relationships can only 
be established by more extensive and well- 
controlled research. 

Moreover, the usual objections made against 
symptomatic treatment on a priori grounds 
lack force. It seems plausible to argue, as 
Jones (7) has, that “Much evidence points to 
the fact that neuroticism is largely a constitu- 
tional defect for which no effective radical 
therapy is yet available. The individual of 
neurotic constitution in certain environmental 
circumstances develops certain symptoms. The 
rational therapeutic approach is then to treat 
the symptoms and to modify the environment 
so as to avoid their recrudescence or the de- 
velopment of fresh symptoms. The more 
specific the treatment, the more likely may be 
its success.” 

Several implications of theoretical and 
practical interest emerge from this study: (a) 
Since both cases manifested blackouts of an 
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unknown nature, particularly Case 1, the 
phobic symptoms presented were not certainly 
known to be functional in origin. The treat- 
ment, however, was oriented towards a func- 
tional analysis of the disorder. (b) It would be 
desirable to know whether, following a course 
of treatment like that described, the neurotic 
reactions are eliminated or merely overshad- 
owed by a stronger normal reaction. Although 
Wolpe (8) provides some evidence that 
neurotic reactions in cats can be eliminated, 
the question must remain open on the present 
evidence. (c) Although on clinical grounds 
Case 1 was regarded as an hysterical person- 
ality and Case 2 as a dysthymic, objective 
psychological tests indicated the reverse. It 
seems likely, moreover, that the etiology of 
the phobic states must have been quite differ- 
ent in the two cases. One would like to know 
how two patients showing such a considerable 
difference in conditionability came to develop 
more or less similar patterns of abnormal be- 
havior. (d) The results obtained during the 
course of treatment are consistent with 
Eysenck’s theory (1) concerning the dynamics 
of anxiety and hysteria. They also indicate, in 
accordance with Eysenck’s view, that for the 
purpose of treatment, adequate regard must 
be paid to the importance of certain individual 
differences as related to the disorders presented. 
(e) The use of drugs may facilitate and speed 
up successful treatment of this type, and this 
theory of individual differences may give a 
rational basis for the selection of drugs. 


SUMMARY 


The treatment of two phobic patients has 
been described. Case 1 manifested an excessive 
fear of going out on her own; Case 2 displayed 
disabling symptoms in the form of an excessive 
fear of going into enclosed and crowded places. 
Both cases had blackouts of unknown nature. 

The treatment program for both cases was 
mainly based on the principle of primary 
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stimulus generalisation. Owing to the more 
“diffuse” nature of the anxiety displayed by 
Case 2, an additional simple conditioning tech- 
nique was employed. 

Case 1 responded to the treatment immedi- 
ately and an improvement in her general be- 
havior was observed. The effects of treatment 
persisted for nearly five months, when the pa- 
tient died of left ventricular failure. 

Case 2 failed initially to respond to treat- 
ment. An attempt was made to account for 
this failure in terms of Eysenck’s theory of 
anxiety and hysteria. From the theory, infer- 
ences were made with respect to the patient’s 
behavior on objective psychological tests; the 
results were consistent with the inferences. The 
treatment was modified according to the theory 
and the patient responded as expected. Four 
months after discharge from the hospital, the 


patient seemed to be managing well and to feel 
much improved. 
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CRITIQUE AND NOTES 


PERCEPTION OF DISTURBING AND NEUTRAL WORDS THROUGH THE 
AUTOKINETIC WORD TECHNIQUE": ° 


SARNOFF A. MEDNICK, ALAN HARWOOD’, anp JACK WERTHEIM 
Harvard University 


mE Autokinetic Word Technique (AWT) 

A presents a situation in which the subject 
(S) is placed in a completely darkened room and 
asked to read and report words and sentences being 
written by an apparently moving but actually 
motionless point of light. A previous study (1) 
indicates that Ss do indeed report the perception 
of words and sentences. The use of the term 

Perception” is perhaps debatable, but the authors 
feel Justified in its use by the similarity of present 
operations to those involved in the study of such 
illusions as the Necker cube. 

One of the important uses to which the AWT 
may appropriately be put is the investigation of 
Nonautochthonous perceptual variables. In this 
study, it was hypothesized that in the AWT situa- 
tion “disturbing” words would be perceived with 
greater latency and less frequency than “neutral” 
words. 


METHOD 


A word association test (WAT) of 40 words was 
administered to 12 male and 12 female paid volunteer 
college students. All of the words used were drawn 
from the recent Minnesota standardization of the 
Kent-Rosanoff Word Association Test on a college 
population (2). They all have a Thorndike-Lorge AA 
rating (3), indicating that they are commonly used in 
the English language. On the basis of probability of 
response (2), five disturbing and five neutral words 
were picked for each S from his WAT results. In cases 
where, due to ties, a decision could not be made solely 
on the basis of probability, the latency of the response 
served as the deciding factor. The median probability 
and latency of responses to the disturbing words were 
001 and 2.10 sec., respectively; the median probability 
and latency of responses to the neutral words were .622 
and 1.30 sec., respectively. 

The Ss were then introduced to the AWT situation 
and instructed that we were going to write sentences 
with the point of light. (These Ss had had two hours 
of previous experience in the AWT situation in another 
study.) Before each sentence was written they were 
informed that they would be given one of the words in 


1'This research was supported by the Laboratory 
of Social Relations, Harvard University, and the 
Social Science Research Council. 

2 Presented to the 1956 meeting of the American 
Psychological Association. 

3 Mr, Harwood was the recipient of an SSRC Under- 
graduate Research Stipend, 1956, for work on this 
Study. 


the forthcoming sentence. This word might appear at 
the beginning, middle, or end of the sentence. The S 
was to inform E when he saw this word. 

Ten trials were then given in which the stationary 
pinpoint of light was turned on for 90 seconds along 
with a cheap electric motor that produced a convincing, 
variable noise. On five of the trials, neutral words from 
the WAT were the suggested words. Disturbing words 
from the WAT were the suggested words on the other 
five trials. Whenever S reported seeing the suggested 
word during the 90-second period, his response latency 
was noted. When the light was turned off, S was asked 
to report the complete sentence that had apparently 
been written by the point of light. E was present in 
the blacked-out room with S, who was seated approxi- 
mately nine feet from the point of light. 


RESULTS 


The two major measures of this study were 
(a) whether or not S reported perceiving a sug- 
gested word, (b) the latency of his reported per- 
ception. Of the 10 suggested words, Ss reported 
failing to see a mean of 3.73 words. Ss failed to see 
a mean of 2.09 of the five disturbing words and a 
mean of 1.64 of the five neutral words. This differ- 
ence, evaluated by the paired replicates test (4), 
was significant at the .01 level. Thus, S failed to 
report seeing more of the disturbing than neutral 
words. 

The mean latency of report of perception of the 
suggested words was 58.6 seconds. The perceived 
disturbing words had a mean latency of 64.4 
seconds. The perceived neutral words had a mean 
latency of 52.7 seconds. The difference between 
these values was significant at the .01 level by 
the paired replicates test. Postponement of the 
report of perception of the disturbing word could 
be the result either of a simple unfilled delay 
of response or of the perception of many words 
preceding the disturbing word. The mean number 
of words preceding the suggested word was 1.02 
words. The disturbing words were preceded by a 
mean of 1.08 words; the neutral words were pre- 
ceded by a mean of .96 words, a difference that is 
not significant. The delay of report of perception 
of disturbing words cannot therefore be attributed 
to the perception of other intervening words. 

The mean number of words reported in the sen- 
= built around the disturbing and neutral 

» Evaluate by the paired replicates test 
was not significant. ý 
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SUMMARY AND CONCLUSIONS 


Disturbing and neutral words were selected 
individually for Ss by means of a word association 
test. It was then suggested to these Ss that an 
apparently moving but actually motionless point 
of light in a completely darkened room (autokinetic 
effect) would write a sentence containing a given 
word. The given word was either a disturbing or 
neutral word. 

The Ss reported perceiving fewer of the disturb- 
ing words. Those disturbing words that were per- 
ceived had a greater latency than perceived 
neutral words. 
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LEARNING THEORY AND “OPPOSITE SPEECH” 


ARTHUR W. STAATS 
Arizona State College at Tempe 


AFFAL, Lenkoski, and Ameen (4), have re- 
j e reported the case of a schizophrenic 
patient demonstrating what they call an “opposite 
speech” syndrome. The authors state that the 
syndrome is basically a reversal in certain language 
usages, especially in the use of “yes” and “no.” 
Fceling that the speech reversal is an expression of 
repressed hostility, they suggest that opposite 
speech copes with the hostile impulses by dis- 
rupting communication and rejecting other people 
or by allowing the “verbalization of ideas which the 
patient consciously rejects” (4, p. 412). 

Laffal et al. conclude that learning theory has 
little to offer in this instance or others of patho- 
logical speech. “. .. [T]he so-called pragmatic and 
interpersonal functions of language behavior have 
been largely neglected by learning theorists in 
favor of the meaning or semantic aspects” (4, 
p. 412). 

However, a number of recent experimental 
and theoretical studies (such as 1, 2, 3, 5, 6, 7, 9, 
10) have been very much concerned with similar 
problems of language behavior. Several points 
can thus be made from a learning theory approach 
which have relevance for “opposite speech.” 
Skinner, for example, generally defines verbal 
behavior as “behavior which is reinforced through 
the mediation of another organism” (7, p. 20). 
Instead of asking, as do Laffal et al., “what uses 
[needs] the opposite speech may serve” (4, p. 412), 
reinforcement theory is concerned with the condi- 
tions which increase the probability that the 
response will occur again in similar stimulus situa- 
tions or which strengthen the response. 


It has been shown experimentally (1, 3, 9, 10) 
that social reinforcement which follows verbal 
responses has the effect of increasing their fre- 
quency. These studies have used various social 
reinforcements. The question is thus raised regard- 
ing the reinforcements involved in opposite speech, 
One possibility may be taken from a discussion of 
generalized reinforcers by Skinner. 

The child who misbehaves “just to get atten- 

tion” is familiar. The attention of people is 

reinforcing because it is a necessary condition 
for other reinforcements from them....The 
attention of someone who is particularly likely 
to supply reinforcement—a parent, a teacher, 
or a loved one—is an especially good generalized 
reinforcer and sets up especially strong atten- 

tion-getting behavior (8, p. 78). 

It is suggested that attention, operating as a 
generalized reinforcer, could account in part for 
the origin and maintenance of opposite speech. 
The interview quoted by Laffal (4, p. 411) is a good 
demonstration of how the patient’s opposite 
speech could elicit attention from others. Other 
reinforcers would also be effective in strengthening 
reversed language. For example, the patient's 
reversals were said to “not extend to... thought 
or action. Thus, if asked if he wishes a cigaret, 
the patient may say, ‘No’ instead of ‘Yes, I do,’ 
but he accepts and smokes the cigaret” (4, p. 412): 
Giving the cigaret to the “No” response is a rein- 
forcement and would be expected to strengthen 
that response, i.e., raise the probability that the 
patient would again say “No” in the same situa- 
tion. Reinforcements such as these, without co? 
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comitant admonition, would be unlikely to occur 
in the normal person’s social environment. 

_ Certain implications are derivable from this 
interpretation. If the opposite speech is maintained 
by positive reinforcement, then lack of such rein- 
forcement should lead to extinction of such 
behavior. For example, withholding the cigaret 
should weaken the strength of opposite speech, 
and giving the cigaret to correct speech should 
strengthen that type of response. 

_ Another example of learning theory formula- 
tions applicable to opposite speech is offered by 
Dollard and Miller (2). They discuss repression in 
terms of reinforcement theory. Certain thoughts 
arouse anxiety. Cessation of thinking those 
thoughts reduces anxiety. Thus, stopping thinking 
about that topic becomes a well-learned response. 
The same analysis can be applied to speech. 
Dollard and Miller (2) give an example of a group 
of people who change the topic of a conversation 
because it arouses anxiety, and state, “people 
tend to learn to avoid unpleasant topics of con- 
versation” (2, p. 199). 

It could be said that the schizophrenic patient’s 
verbal behavior, when it is not confused by rever- 
sal, elicits an anxiety response in him, perhaps 
because of its typical content. Confused ways of 
speaking, and perhaps even of thinking, would 
therefore be anxiety reducing. This rationale could 
be extended to obsessive thinking in addition to 
other types of confused schizophrenic speech and 
thought. In addition, for the schizophrenic, com- 
munication with others which is understandable 
probably introduces touchy subjects which arouse 
anxiety. Reversed verbal behavior and other con- 
fused speech may reduce anxiety when it produces 
breakdown of the communication and cessation 
of the anxiety producing subject matter or of the 
bothersome conversation itself. 

Why the schizophrenic’s speech might elicit 
anxiety in him need not herein be elaborated, since 
the speech symptom is the relevant topic. It could be 
stated, however, that the unhappy life situation 
of an adult schizophrenic probably elicits thought 
and speech which are not positive secondary rein- 
forcers, but instead arouse anxiety. It is also prob- 
able that the lack of success of the schizophrenic’s 
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life behavior evokes verbal behavior from others 
which is anxiety producing for the schizophrenic. 
Verbal and nonverbal behavior of the schizophrenic 
which would avoid this anxiety would thus be 
well learned. 

In concluding, it should be stated that this note 
is not intended as a complete analysis of the op- 
posite speech of the schizophrenic patient. Perhaps 
it points out that hypotheses which apply to 
pathological language can be derived from a learn- 
ing theory approach. At any rate, it is suggested 
that learning theory has reached a state where it 
has something to offer clinical theory and practice. 
The complex activities to which contemporary 
learning theory addresses itself, and not without 
success, indicate that the approach can no longer 


be ignored. 
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THE INFLUENCE OF ETHNIC ATTITUDES ON REASONING ABOUT ETHNIC 
GROUPS! 


NORMAN M. PRENTICE? 
Harvard University 


N THE abundant literature on ethnic prejudice, 
I studies exploring the effects of tolerant and 
intolerant attitudes on reasoning about ethnic 
groups are rare. In one of the few relevant investi- 
gations, Thistlethwaite (5) demonstrated that 
prejudiced subjects biased their reasoning against 
ethnic groups. Moreover, an account by Allport (2) 
of unpublished research by S. B. Sells suggested 
that tolerant as well as intolerant subjects may 
bias their reasoning about ethnic groups in direc- 
tions resonant with their attitudes, a finding in 
opposition to the traditional characterization of 
tolerant persons as rational and objective in their 
appraisal of ethnic groups. The account was none- 
theless consistent with the evidence from related 
attitude areas that opposing social attitudes con- 
duce to distorted reasoning in directions congruent 
with these attitudes (4). The aim of the present 
experiment was to examine more rigorously the 
influence of ethnic attitudes on reasoning logically 
about ethnic groups. The prediction was made 
that tolerant individuals would show bias in Javor of 
elhnic groups while intolerant individuals would 
show bias against ethnic groups on a test of syl- 
logistic reasoning involving ethnic content. 


METHOD 
Test Measures 


1. Attitudes. Ethnic attitudes were measured by an 
18-item form of the California Ethnocentrism (E) 
Scale, composed of an equal number of statements 
disparaging Jews and Negroes.’ Authoritarian attitudes 
were appraised by a 12-item form of the California 
Fascism (F) Scale. 


1This report is based on portions of a doctoral 
dissertation completed at Harvard University in the 
Department of Social Relations in 1956. The author 
wishes to express particular indebtedness to his prin- 
cipal advisers: Drs. G. W. Allport, G. Lindzey, and 
R. W. White. Drs. N. Kogan and D. J. Levinson 
generously read this report and offered many valuable 
suggestions. 

2 Now at Judge Baker Guidance Center, 

* Items selected from Adorno et al. (1) were numbers: 
8, 22 (pp. 68-69); 13 (p. 70); 2, 31 (pp. 110-111); 37 
(p. 117); and 1-6 of Subscales A and B (p. 142). 

‘The form given was taken from the FERPT 
Battery used by Dr. D. J. Levinson at Harvard Uni- 
versity, Department of Social Relations, 1952. With 
one exception (“More than anything else, it is good 
hard work that makes life worthwhile”), the items are 
either identical or slightly rephrased from Adorno et al. 
(1): b-44 (p. 248); b-21, b-42, c-13, c-25, c-34, ¢-37 
(p. 255); a-9, e-4, f-26 (p. 256); h-18 (p. 257). 


2. Objective reasoning about ethnic groups. The ability 
to reason logically about ethnic groups was provided by 
performance on a syllogism test constructed by the 
author called, “The Woodbridge Reasoning Test.’’® 
This test consisted of 48 syllogisms randomly arranged 
which the subjects (Ss) rated for logical validity. The 
Ss were instructed to rate the conclusions to the syl- 
logisms as following logically or not necessarily follow- 
ing logically from the premises. Of the 48 syllogisms, 
24 were logically valid and 24 logically invalid by this 
definition. Each group of 24 contained 8 subgroups of 
three syllogisms. The three syllogisms in each of these 
groups were alike in validity, syllogistic mood, approxi- 
mate sentence length, and linguistic style, but unlike 
in content. The premises and conclusions of one syllo- 
gism were tolerant, those of a second were intolerant, 
while those of a third were neutral; that is, designed 
not to produce any special affective reaction. The con- 
tent of none of the affective syllogisms was formally 
identical to that in the E scale, though both measures 
dealt with Jews and Negroes exclusively. In contrast 
with the syllogisms, all of the E scale items were 
negative; i.e., critical of Jews and Negroes. Content 
for some affective syllogisms was adapted from Thistle- 
thwaite (5), while that for others was drawn from 
stereotypes of minority groups enumerated by Allport 
(2).° The following triad in which all the conclusions 
are logically invalid illustrates the construction: 


Tolerant 


If no one who is especially loud, noisy, and over- 
bearing is admitted to a country club; 

And if many Jews are admitted to a country club; 

Then: No Jews are especially loud, noisy, and over- 
bearing. 


Intolerant 


If no one who mixes especially well with other people 
is aloof from the community activities; e 
And if many Jews are aloof from the community 

activities; 
Then: No Jews mix especially well with other people- 


® This test, together with a table indicating syllo 
gism construction, has been deposited with the Amer 
can Documentation Institute. Order Document No- 529. 
from ADI Auxiliary Publication Project, Photodupli- 
cation Service, Library of Congress, Washington +e 
D. C., remitting $1.25 for 35 mm, microfilm or $1- 


part- 
= f or 
ment, University of Chicago, kindly assessed the © 
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ETHNIC ATTITUDES AND REASONING 


Neutral 


If no one who is especially interested in Egyptian 
art is concerned with Mexican art; 

And if many dealers are concerned with Mexican art; 

Then: No dealers are especially interested in Egyp- 
tian art. 


Approximately 35 minutes were allowed for com- 
pletion of the syllogism test. 

- An error score formula was derived on the assump- 
tion that the influence of ethnic attitudes would lead 
to reasoning errors in predictable directions on syllo- 
gisms with particular ethnic content. Thus, tolerant 
individuals were expected to have more errors on 
reasoning items that were valid but contained intol- 
erant content and on reasoning items that were invalid 
but contained tolerant content. These same individuals 
were expected to have fewer errors on reasoning items 
that were valid but contained tolerant content and on 
reasoning items that were invalid but contained intol- 
erant content. The converse was predicted for intolerant 
individuals. Thus errors in judgment of validity were 
expected to be systematically rather than randomly 
distributed across the reasoning items with affective 
content. The error score used for the analysis was as 
follows: 


Error Score = [(AV — PV) + (PI — AD] 
where 


AV = Anti-Valid: number of errors on valid 
syllogisms with intolerant content. 


PV = Pro-Valid: number of errors on valid 
syllogisms with tolerant content. 

PI = Pro-Invalid: number of errors on invalid 
syllogisms with tolerant content. 

AI = Anti-Invalid: number of errors on invalid 


syllogisms with intolerant content. 


The use of a difference score in the formula controlled 
for the level of reasoning ability. Neutral items were 
included in the syllogism test to provide variety of 
content and for other purposes, but scores on these 
items were not used in the analysis herein. 


Subjects 

The tests were taken anonymously by 128 white 
Christian male and female freshmen enrolled at the 
Boston University School of General Education in the 
early Fall of 1955 who volunteered for a “study of 
social attitudes.”? The Woodbridge Reasoning Test, E 
and F Scales, were administered in that order. None of 
the Ss had had courses in psychology or formal logic. 


RESULTS 


Product-moment correlations between the E 
and F scores and the mean error scores on the 
Woodridge Reasoning Test were —.22 and —.30. 


7 The cooperation of Dr. G. N. Eddy and Mr. R. 
Smart, Social Science Department, Boston University 
College of General Education, in making Ss available 
for this study and prior pilot studies, is gratefully 
acknowledged. 
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TABLE 1 


Comparisons OF MEAN Error Scores WITH 
HYPOTHETICAL ZERO MEAN SCORES ON 
WOODBRIDGE REASONING TEST FOR 
Inpivipvats Hicu anp Low 
on E anD F SCALES 


N x t $ 
Low E 31 1.74 4.25 <.001 
High E 32 63 1.27 Reversal 
Low F 30 1.5% 3.75 <.001 
High F 29 —.41 — .88 <.19 


_ Note.—Values are for one-tailed ¢ test [following Lord as 
cited by Mosteller and Bush (3). 


These values are significant beyond the .02 and 
‘01 levels (one-tailed test), respectively. The 
correlation coefficients are negative since stronger 
intolerant attitudes (high E and F scores) are 
associated with less reasoning bias in the tolerant 
direction. 

A subsequent analysis compared the mean error 
scores on the Woodbridge Reasoning Test made by 
individuals in the upper and lower quarters on the 
E and F scales, with hypothetical mean error scores 
of zero (unbiased reasoning). As the error score 
formula is applied, positive error means will be 
associated with the lower quarters of the E and F 
scales and negative error means with the upper 
quarters if respondents are biasing their reasoning 
in a direction congruent with their attitudes. 
Table 1 presents the results from this analysis. As 
indicated, in both the lower E and F quarters, 
significant bias operating in favor of ethnic groups 
was demonstrated (<.001 in both instances). 
However, in the high E and F quarters, significant 
bias operating against ethnic groups was absent, 
although the upper quarter F mean tended in the 
predicted direction. The absence of intolerant bias 
can be presumptively related to the composition 
of the sample. While the neutral item mean for 
both the attitude scales is 4.00, the item mean for 
the E scale was 2.72. Thus, as a whole, the sample 
with respect to the E scale was composed pre- 
dominantly of tolerant individuals. The sample 
with respect to the F scale was somewhat more 
normally distributed with an item mean of 3.80. 
In this connection, an important distinction be- 
tween relative and absolute position along these 
attitude continua should be made. In the present 
study, for example, the upper quarter E group, 
although extremely intolerant relatively, tended 
to be slightly tolerant in terms of the total possible 
range of ethnic attitudes measured by the scale. 
Indeed, as indicated in Table 1, the High E group 
showed a tendency to bias in the tolerant direction. 
Clearly, in considering extremes, it is important 
to be concerned not only with the samples at hand 
but with the theoretical universe as well. 
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DISCUSSION 


In the present study, tolerant individuals biased 
their reasoning about ethnic groups, while 
Thistlethwaite (5) using a similar method in 1948 
reported intolerant bias alone. The present experi- 
ment conducted in 1955 in a social climate some- 
what altered during the intervening years, used a 
completely Northern sample lacking highly prej- 
udiced Ss. Thistlethwaite used Southern samples 
which presumably contained many highly prej- 
udiced Ss. He compared the direction of reasoning 
bias in these with Northern samples which may 
have contained both intolerant and tolerant 
respondents. As a consequence, the tolerant bias he 
predicted for his Northern samples may have 
been attenuated. However, as Thistlethwaite 
selected his comparison groups on the basis of 
region rather than on standard attitude scales, 
more exact comparisons of these related studies 
cannot be made. Nevertheless, the findings of 
both studies taken together support the general 
assumption that individuals with opposing social 
attitudes distort their reasoning in directions 
congruent with these attitudes. Moreover, the 
findings from the present study challenge the 
conventional emphasis on veridicality in tolerant 
individuals. Apparently, tolerance is not always 
associated with superior reality oriented reasoning. 


SUMMARY 


This study was undertaken to investigate fur- 
ther the relationships between ethnic attitudes 


Norman M. 


PRENTICE 


and objective reasoning about ethnic groups. lt 
was argued that tolerant individuals would bias 
their reasoning in favor while intolerant individuals 
would bias thcir reasoning against ethnic groups. 
The findings revealed that tolerant individuals 
biased their reasoning significantly in favor of 
ethnic groups. However, intolerant individuals 
did not bias their reasoning significantly against 
ethnic groups. The lack of bias against ethnic 
groups was presumptively related to the absence of 
strongly prejudiced respondents in the sample. 
Biased reasoning on the part of unprejudiced 
individuals challenged the traditional assumption 
that tolerance is invariably accompanied by supe- 
rior reality directed thinking. 
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ERRATA 


In the article “Perceptual Organization and 
Intelligence: A Further Study,” by Clinton 
De Soto and H. Leibowitz, which appeared in 
the November, 1956 issue of this journal (J. 
abnorm. soc. Psychol., 1956, 58, 334-337), some 
incorrectly calculated probabilities of Type II 
errors were given. Recalculated probabilities 
are given in Table 1. These are probabilities of 
Type II errors for 2-tail ¢ tests of the differ- 
ence between mean vocabulary scores of high 
and low perceivers in case of certain hypo- 
thetical true differences in various experi- 
mental conditions and groupings, as explained 
in the original article. The probabilities in 
Table 1 are substantially greater than those 
given earlier, but low enough that a complete 
absence of significant s would be surprising 
if a large true difference existed. All /’s ob- 
tained were nonsignificant, and the average 
difference was in fact in the wrong direction. 


In the article by S. J. Korchin and S. 
Levine, “Anxiety and Verbal Learning,” this 
Journal, 1957, 54, 234-240, it was erroneously 
Stated that Dr. Korchin is now at the Research 
Division, Columbus State Institute of Psy- 


It may be noted that 1-tail tests might have 
been used, since the direction of the difference 
was hypothesized. One-tail / tests would have 
had lower probabilities of Type II errors than 
2-tail ¢ tests for the hypothetical true differ- 
ences listed, and would also have yielded uni- 
formly nonsignificant results. 


TABLE 1 
Condition Difference ag ge 
0.01 sec. 3.76 -65 
0.01 sec. 5.20 +42 
0.10 sec. 3.99 -56 
0.10 sec. 5.53 30 
1.00 sec. 3.39 57 
1.00 sec. 4.70 .32 
Above combined 1.87 -63 
Above combined 2:53 -40 
30 sec. 1.87 72 
30 sec. 2.53 +05 


chiatry, Ohio State University. This statement 
applies to Dr. Levine. Dr. Korchin remains 
with the Institute for Psychosomatic and 
Psychiatric Research and Training, Michael 
Reese Hospital. 
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CONSCIOUSNESS AS A REGULATORY FIELD: 
A THEORY OF PSYCHOTHERAPY 


REX MADISON COLLIER 
VA Hospital, Jefferson Barracks, Missouri 


WO PREVIOUS papers have outlined and 
partially developed a regulatory theory 

] of consciousness. The first (3) was a 
brief initial statement which applied only to 
nonstress conditions; the second (4) developed 
the theory in terms of stress-defense dynamics 
and made applications to basic problems in 
Psychopathology. The present paper proposes 
to interpret selected aspects of the process of 
psychotherapy in terms of these concepts. 

Any adequate review of the literature on 
psychotherapy (e.g., Ellis [8]) reveals the grow- 
Ing tendency to eclecticism. In any field, ec- 
lecticism implies the need for a set of basic 
concepts that bring system, relatedness, and 
even unity to the common factors empirically 
selected from a variety of sources. Among the 
few current attempts to integrate the varied 
Phenomena of psychotherapy in terms of a 
Systematic set of concepts, the most notable 
have originated from Hullian learning theory 
(7, 12, 14, 15, 16). The chief difference between 
this type of theory and the kind here being 
developed is probably one of emphasis. Hullian 
theory emphasizes peripheral factors for the 
sake of accessibility and objectivity. The pres- 
ent theory places more emphasis on central 
factors with utilization of the biological con- 
cept of regulation. Some reduction in objec- 
tivity is risked for the purpose of attempting 
an organized approach which gives a more 
appropriate place to the unsuccessfully rejected 
concept of consciousness. 

Any theory of how psychotherapy functions 
must be based on concepts of integration or 
the broader concept of regulation. The prob- 
lem of psychotherapy may be stated as that of 
discovering and managing the conditions which 
facilitate the re-establishment of a more ade- 
quate degree of self-regulation. It is assumed 
that no definition or approach to psycho- 
therapy is complete or sufficiently basic with- 
ut including the concept of self-regulation in 
Some form. The assumptions (a) that con- 
Sciousness has a functional role in behavior and 
b) that the concepts of field and regulation are 
*ssential to this function not only give new 


meaning to the neglected but inescapable con- 
cept of consciousness but also lay the basis for 
developing and defining an interrelated group 
of dynamic concepts that seem to be basic in 
behavior. 

While some writers (e.g., John Dewey [6]) 
have denied that consciousness can be success- 
fully defined, for the purposes of this theory 
the following definition is proposed: Conscious- 
ness is an awareness field of biopsychological 
origin in which processes which may eventuate 
in action have increased chances of interaction 
and mutual modification. Thus, through such 
an awareness field, however simple and rudi- 
mentary or complex and comprehensive, the 
organism is provided with the means for a 
more flexible adaptation. In this sense, there- 
fore, consciousness assumes a role which gives 
it great survival value in the evolutionary 
process, Since the problem of psychotherapy is 
to a large extent one of reorganization and 
readjustment, it is no mere coincidence that 
Progress in reorganization is closely related to 
the degree to which the individual can allow 
himself to become aware of significant issues. 

The regulatory effectiveness of the conscious 
field is dependent upon its magnitude which 
varies with the degree to which processes rele- 
vant to the current issues may be accessible to 
the field. A reduction of this kind of magnitude 
through blocking, resistance, or repression re- 
Sults in some degree of coincident reduction 
in the flexibility and regulatory adequacy of 
the conscious field. It is assumed that the more 
permeable the conscious field boundaries are to 
Processes relevant to a current issue, the 
broader and more inclusive is the regulatory 
frame of reference. 

The conditions producing variations in the 
regulatory effectiveness of the conscious field 
have been described in terms of five dynami- 
cally interrelated conceptualized variables, 
These are as follows: (a) stress, (b) defense 
system with primary and secondary compo- 
nents, (c) stress-tolerance thresholds, (d) ego 
strength, and (e) boundary permeability of the 
conscious field. These concepts, defined and 
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utilized in detail in a previous paper (4) lead 
to a central concept of stress-defense dynamics. 
In brief schematic form, the problem of psy- 
chotherapy in terms of the regulatory theory 
of consciousness and these related variables 
may be stated as follows: 

1. The psychopathology which the individ- 
ual (patient! or client) seeks to reduce is a 
product of the individual’s system of defense 
interacting with experienced stress. : 

2. As a result of the stress-defense interac- 
tion, the regulatory field of consciousness be- 
comes restricted to a degree that cripples its 
regulatory effectiveness. This allows the re- 
jected segments of experience and of the self 
to produce biased and poorly managed im- 
pulses and behavior which often dominate 
responses to the further discomfiture of both 
the individual and his associates. 

3. Assuming that the pathological processes 
are reversible to some degree, the first or pri- 
mary essential in psychotherapy is the provi- 
sion of a relationship designed to reduce the 
individual’s need for his defensive system and 
coincidentally to increase his threshold of toler- 
ance for stress. Therefore, some degree of 
dependence of the individual on the therapist 
for some period of time is necessary in most 
instances. 

4. Communication between the individual 
and the therapist facilitates intrapersonal com- 
munication, which in turn increases chances 
for greater permeability of the restricting de- 
fensive boundaries to the conscious field. 

5. Increases in ego strength—the ability to 
accept into consciousness potentially stressful 
material and to deal with it constructively— 
occur as the individual experiences acceptance 
by another who is significant to him. Thus, the 
individual learns to accept himself and thereby 
lays the basis for a more adequate degree of 
independence and self-regulation. 

6. The rate at which rejected segments of 
experience and self can be returned to the 
regulatory conscious field depends upon all of 
these interrelated factors taken together, but 


1The term “patient” is used here with special 
reservations. The traditional connotation that the 
patient is the passive recipient of things done to him 
and of orders carried out for him must be rejected. For 
psychotherapy to be effective, the patient must be an 
active participant in his own treatment. It is with 
this connotation that the term is used in this discussion. 
For discussions of this and related issues reference is 
made to Lindner (11) and Shakow (13). 
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initially the most important factor is the 
therapeutic relationship. 

7. Increases in ego strength are of great 
importance, for they are associated with the 
following changes: (a) higher stress-tolerance 
thresholds, (b) reduction of need for the de- 
fense system, (c) increased permeability of the 
boundaries to the conscious field, and (d) in- 
creased self understanding or insight. Taken 
together, these results tend toward greater 
emotional and intellectual freedoms to move 
in directions more representative of the total 
personality. 

It is assumed that there is no final or essen- 
tial conflict between this theoretical approach 
and the established facts of learning. Some 
learning theories will emphasize particulars and 
their constancies, e.g., S-R, cues, verbal sym- 
bols, etc., while here the emphasis is on central 
regulation, field interaction, and the mutual 
modifiability of the interacting processes. Some 
may worry about the subjectivity aspect of the 
concept of consciousness, but it is important to 
note that even some behavioristic theories have 
much need for the intervening variable. The 
conscious field may be thought of, if one wishes, 
as a field of interacting intervening variables 
with regulatory significance for both the S and 
the R. This will emphasize, however, the “O” 
of Woodworth’s S-O-R formula (17) and re- 
duce the emphasis on S-R aspects. Often the 
essential difference between theories is the 
place of emphasis as one plays the scientific 
game of attempting to make a subject matter 
more intelligible, 

During any attempt to escape the current 
eclecticism in psychotherapy and to arrive at a 
ea eee a t of ny 

, ersome, phenomena will be 
encountered. A selected group of these to which 
the regulatory theory of consciousness may be 
applied is as follows: communication and psy- 
chotherapy, free association, insight, effects of 


“uncovering” and interpretation, abreaction, 
the relationship aspect of psychotherapy, and 
group relationship factors. While this is not an 
exhaustive list, it serves to illustrate how an 
ordering of the concepts may occur in terms of 


the regulatory theory of consciousness. 


COMMUNICATION AND THERAPY 


If the function of the conscious field is 
regulatory along with maintenance of flexi 
bility of adaptation to immediate issues, the? 
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the adequacy of this regulation is dependent 
upon what is available to the field. Cultural 
predispositions require, for the sake of clarity, 
a repetition of the point that this concept of 
consciousness is far broader than a synonym 
for reasoning, intellectualizing, or problem 
solving as these terms have been traditionally 
used. Western culture predisposes the individ- 
ual to assume that it is “bad taste” to reveal 
strong feelings. The child who experiences 
Strong emotion arouses anxiety in his elders 
and soon learns to fear and reject his own 
feelings. “Control” is thus acquired through an 
habituated exclusion of affect. The concept of 
consciousness that tends to be current is in 
part characterized by “flattened affect.” Thus, 
the group predisposes itself to a culturally 
based incipient schizophrenia. To whatever 
extent the individual has denied himself the 
Opportunity or privilege of experiencing his 
feelings, anxieties, dreads, elations, etc., to that 
extent he has become a more rigidly organized 
individual with fewer chances for these feelings 
to become flexibly related to other areas of his 
personality. A kind of intrapersonal communi- 
cation has become interrupted. The function of 
communication in the process of psychotherapy 
is to make available to the conscious field, proc- 
esses relevant to a given issue. Maximal ade- 
quacy of self-regulation is associated with 
maximal intrapersonal communication. Unac- 
ceptable experiences and feelings often become 
rejected and alienated from the main stream of 
awareness that tends to represent the self. 
From these rejected segments of experience, 
feeling, and self-evaluation, there are continued 
effects in the forms of anxiety, impulsive and 
compulsive desires, drives, irritations, and feel- 
ings of frustration that are poorly integrated 
with the main trends of personality. These re- 
jected segments of feeling and experience are 
as rigid and unchanging as is their degree of 
dissociation from the conscious field. Re-asso- 
ciation through improved intrapersonal com- 
munication is not only a goal of psychotherapy 
but is also recognized here as a goal of growth 
and increased maturity and hence should have 
an even wider application. One implied appli- 
Cation is that the individual should learn to 
accept and respect the affect aspect of experi- 
ence even though this may, at times, cause him 
Considerable anxiety. 

Since interpersonal communication tends to 
&cilitate intrapersonal communication, psy- 


chotherapy may be represented as a specialized 
and controlled means of facilitating intraper- 
sonal communication. Barriers between dis- 
sociated segments of the personality tend to 
become more permeable as one continues to 
talk about the issues relevant to the dissociat- 
ing processes. A simplified formula may be 
proposed as follows: What one can talk about 
indicates what he can begin to accept, and what 
one can accept indicates what he can begin to 
manage. Thus, a step is taken in the direction 
of a more adequate degree of self-regulation. 


‘FREE ASSOCIATION 


The fundamental rule in traditional psycho- 
analytic procedures is that the analysand 
should say or represent verbally all that comes 
to mind, regardless of its nature. It can be seen 
immediately that, in terms of the regulatory 
theory of consciousness, the analyst attempts 
to utilize a basic device for facilitating the re- 
introduction of segments of experience to the 
conscious field. Verbalization heightens aware- 
ness, and the process of talking with reduced 
restriction encourages exploration and dis- 
covery of significant neglected and rejected 
aspects of experience. There are increased 
chances that a rejected aspect of experience 
reintroduced to consciousness will enter into 
mutually modifying relations with other proc- 
esses currently available to the field. This can 
provide the discovered or uncovered aspect of 
experience with new context and a revised 
meaning. Hence, some increased degree of 
flexibility has been achieved. However, the 
invitation to a patient to use free association 
can be extremely threatening. It is an invita- 
tion in many instances to disturb, even to 
disrupt, a kind of equilibrium by which the 
individual has become accustomed to live. In- 
sistence on the observance of the “fundamental 
rule” often places the analyst and the analy- 
sand in conflict, since the defense system of the 
analysand is invariably activated, Tradition- 
ally, the defense mechanisms were viewed as 
more or less nuisance devices that prevented 
the analysis from progressing smoothly toward 
an early successful conclusion. The approach 
of “Off with his defenses” often has subtle 
similarities to the Red Queen’s command of 
“Off with their heads!” 

No therapy is adequate that does not re- 
spect the individual’s defense system as a basic 
aspect of his personality. Free association, be- 
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cause of its potencies, is both a useful and a 
hazardous technique. To invite the individual 
to free associate is to invite him to lay aside 
his defenses and accept back into awareness 
that which he has found habitually unaccept- 
able and stressful. In terms of stress-defense 
dynamics, free association may be either harm- 
ful or beneficial. It is hypothesized, in terms of 
the present theory, that free association is 
therapeutically successful under the following 
circumstances: (a) if the individual’s defenses 
operate well enough so that stressful material 
is fed back to the conscious field at a rate that 
does not exceed ego strength as here defined; 
(b) if the relationship with the therapist is 
sufficiently supportive so that there is some 
increase in ego strength; and (c) if the thera- 
pist’s interpretations do not release anxiety 
that exceeds the patient’s stress-tolerance 
threshold. Free association can be less than 
successful, or even harmful, if the patient’s 
secondary defenses are breached before there 
has occurred sufficient development of ego 
strength so that secondary defenses may be less 
necessary to the patient. If the stress of free 
association in conjunction with too penetrating 
interpretations robs the individual too quickly 
of his secondary defense components, he will 
revert to primary defense. The ensuing results 
depend upon the seriousness of the problem. 
The patient may in some of the less serious 
instances simply leave therapy with less ac- 
cessibility to future attempts, or, if he stays in 
therapy, he may show psychotic trends. In the 
more serious instances, a psychotic episode 
may occur. It is hypothesized that, were ade- 
quate records of psychotic breaks during 
analyses available, it would be found (a) that 
the patient initially had an inadequate second- 
ary defense elaboration and depended chiefly 
on primary defenses—a dynamic arrangement 
which was not clearly recognized by the thera- 
pist—or (b) that what secondary defenses he 
had were interpreted away before ego strength 
had improved sufficiently to tolerate the added 
stress. 

The couch and a reclining position 
usually thought of as related to the techni 
of free association. Placing the patient on a 
couch where he cannot face his therapist can 
be damaging if the initial ego strength is not 
high enough to accept the stress involved in 
such a nonsupportive relationship. For a pa- 
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tient with weak ego strength, which tends to 
go with predispositions to utilize primary de- 
fense, to be on a couch with an aloof analyst 
out of sight and to be requested to free associate 
about his problems is to take a long step in 
the direction of precipitating a psychotic 
episode. 

Free association is a powerful technique for 
facilitating the reacquisition of experiences, 
feelings, and meanings to the conscious regula- 
tory field, but the speed of reacquisition must 
be compatible with the balance between stress- 
tolerance thresholds, or thresholds of defense 
reaction, and current ego strength. The proc- 
ess traditionally referred to in psychoanalysis 
as “making the unconscious conscious” needs 
extensive review in terms of the stress-defense 
dynamics which are here being outlined. 


INSIGHT 
From what has thus far been written, it 
should be clear that the concept of insight can 
be useful, but it must be far broader in psy- 
chotherapy than the insight of nonemotional, 
intellectual, problem-solving behavior. Insight 
in psychotherapy is based on the acceptance 
into consciousness of hitherto neglected or re- 
jected aspects of experience under conditions of 
sufficient emotional permissiveness for these 
aspects, as well as other relevant aspects of 
experience, to interact so that new and more 
comprehensive meanings emerge. Insights are 
. . . K 
therefore, limited (a) if the boundaries to the 
conscious field are relatively impermeable to 
thoughts, feelings, and tendencies relevant to 
the problem, (b) if ego strength is low, and 
(c) if experienced stress is currently high 
enough to keep t} i h 
Pr a ale P the primary component of the 
c system active. The sud ic 
insights, described į ae oe nie 
seth Say cribed in the textbooks with mis- 
= ing frequency, are simply one of the rarer 
: ps of the broader principle of intraper- 
ona icati i i 
p ie Ea referred to earlier. To be 
i pen-minded” (the concept of the permeable 
a -j 
trier to the Conscious field) toward one’s ex7 
periences and feelings and to what these have 
at to one, is a basic objective not only P 
me ae but in the maintenance ° 
af the health for everyone. If this conditio” 
es conscious regulatory field is maintain? 
= oe may occur fairly frequently, i J 
a a them are likely to be dramatic with 7” 
Sard to one’s emotional life, since his adj" 


ea 


7— 


Ve 


ee 


Consciousness As REGULATORY FIELD 


ment problems are kept up-to-date and there 
are no significant areas of experience that re- 
main rejected or alienated. 

, While the facilitation of insightful reactions 
IS an important goal in psychotherapy, the 
more fundamental objective is to provide the 
individual an opportunity gradually to accept 
and to deal realistically with aspects of him- 
self and his experience that heretofore he has 
had to neglect or reject. The reaction of insight 
can be thought of asa criterion that such intra- 
Personal communication is occurring. How- 
ever, insights must be more than logical; they 
must also represent an acceptance of the emo- 
tional meaning of experience. The logical in- 
sight by itself may indicate that the more 
affective aspects of experience are still being 
rejected. Logical or intellectual insight tends 
to modify behavior chiefly under nonstress 
conditions. Where the problem involves threat 
and anxiety sufficient to activate defenses, the 
insight must be affectively toned and compre- 
hensive enough to indicate a real acceptance of 
the basic threat issue. The terms and processes 
of logical problem-solving are often less than 
adequate for behavior under stress. 

If the fundamental goal of psychotherapy is 
to re-establish a more adequate degree of self 
regulation, then the techniques used must 
facilitate through interpersonal communication 
an improved intrapersonal communication. 
The broader concept of insight as used here 
becomes one criterion for movement toward 


this goal. 


THE UNCOVERING OR INTERPRETIVE 
CONTINUUM 


When a therapist interprets or invites a 
patient to explore his experiences, the proce- 
dure is sometimes referred to as an “uncover- 
ing” technique. How a therapist’s response 
may be scaled according to the degree of un- 
covering represented has been outlined in 
previous studies (2). The concept of scaling the 
depth of interpretive responses of the therapist 
is of great importance in reference to the stress- 
defense dynamics. From this a clearer rationale 
seems to appear for determining the degree of 
depth of interpretive response of the therapist 
appropriate at any particular time for the pa- 
tient. The question may be asked whether the 
therapist should restrict himself to listening, to 
repeating significant words or phrases, to clar- 
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ifying or reflecting, or whether he should in- 
clude greater depth in some of his responses. 
If the objective of therapy is aiding the patient 
in achieving an improvement in self-regulatory 
ability, then it must be assumed that he needs 
to accept into the conscious field and to inte- 
grate constructively into action patterns some 
aspects of himself and of reality around him 
that heretofore he has rejected or neglected. 
The way in which he presents his problem at 
any given time reveals his capacity (ego 
strength) to deal with the problem. What he 
leaves out or attempts to avoid reveals, in part, 
his use of primary defense (not simply resist- 
ance). How he rationalizes, compensates, and 
utilizes various symptom patterns reveals not 
only his need for secondary defense but also 
his capacity to blunt the sharpness of stress 
factors so that he can still deal with reality to 
some extent (4). It often occurs in the begin- 
ning of a therapy relationship that the patient 
is quite restricted in his approach to the prob- 
lem and in what seems to him as relevant. The 
therapist’s technique in managing the relation- 
ship is a first step in reducing stress which 
often aids in some coincident increase in ego 
strength. Material relevant to the problem, 
but held at a superficial level of rejection, may 
still not be verbalized. Stress effects are often 
obvious, but, if stress-tolerance thresholds are 
not greatly exceeded, the patient may, even 
so, deal with the material integratively. Any 
interpretation that brings in material that the 
patient can neither accept nor deal with con- 
structively, that reactivates primary defenses, 
or that stimulates symptom formation may be 
said in general to exceed the therapeutically 
appropriate. For the therapist to insist on an 
interpretation which involves the risk that the 
patient may feel less accepted or less under- 
stood is to retard the therapeutic processes as 
defined here. The therapist may be as pene- 
tratingly interpretive as the current stress- 
defense dynamics predispose the individual to 
accept and to utilize such interpretations. This 
contention involves the assumption that the 
interpretation has some degree of validity that 
may be sifted out by continued exploration. 

A persistent hazard of the more penetrating 
interpretation is that it can rob the patient of 
the ego strength-building experience of self- 
discovery. If the purpose of therapy is to 
improve the quality and adequacy of self- 
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regulation, then there should be maximum 
opportunity for the patient to make his own 
discoveries and his own applications. The ap- 
propriate technique for the therapist often 
grows out of the question put to himself, “How 
can I set the situation so that the patient can 
recognize a feeling, relationship, meaning, etc., 
and retain the feeling that he discovered it for 
himself?” While this cannot always be done, it 
should occur sufficiently often, or the ther- 
apist is not providing an opportunity for the 
self-regulatory capacities of the individual to 
grow. This approach not only avoids over- 
dependence on the part of the patient, but it 
also represents an area in which psychotherapy 
and teaching should have greater overlap. 

To make a sharp distinction between sup- 
portive (often called “covering”) and explora- 
tory (or “uncovering”) therapy has become 
relatively frequent. In terms of the stress- 
defense dynamics which have been discussed, 
this kind of dichotomy is undesirable. To talk 
about one’s problems is to uncover threaten- 
ing aspects in some degree. How much revela- 
tion is appropriate depends on the relationship 
between therapist and patient, the degree of 
ego strength available to the patient, and the 
balance between primary and secondary seg- 
ments of the defense system. The supportive- 
uncovering dichotomy should be thought of as 
a depth continuum. Currently, the therapist 
must depend on his clinical judgment as to 
how exploratory he should be or how explora- 
tory he should allow his patient to be or at 
what level on the depth continuum it is appro- 
priate to work. Depth can be defined here in 
terms of the typical degrees of hesitation and 

resistance encountered in the social group in 

discussing issues of dynamic importance to 

them. Such a scale has yet to be devised, but 
it is experimentally feasible and could have a 
variety of uses. In any case, some degree of 
depth is represented by any discussion. There- 
fore, it is important to point out that no 
therapist can be supportive in any adequate or 
relevant manner until his patient has provided 
some exploration of his problems. To attempt 
to be only “supportive” in the traditional sense 
is often to leave the patient with a feeling that 
he is not well understood or even that he has 
been somewhat rejected. 


ABREACTION 


Abreacting a traumatic or stressful experi- 
ence has been recognized as having both 
therapeutic and nontherapeutic effects. Grin- 
ker and Spiegel (10) have said that abreac- 
tions under pentothal are not necessarily 
therapeutic, but may provide a beginning of 
useful insight. They have also said that abre- 
actions in a bar under the influence of alcohol 
are nontherapeutic. Dollard and Miller (7) pro- 
pose that the difference between therapeutic 
and nontherapeutic abreactions may be under- 
stood in terms of the concepts of a conditioned 
response theory of learning, e.g., reward and 
punishment, reinforcement and extinction, and 
generalization. How would stress-defense dy- 
namics conceptualize this issue? 

Abreaction, asa reliving of a past experience, 
provides an opportunity, under conditions of 
increased permeability of conscious boundaries, 
for a rejected segment of experience to be re- 
introduced to the conscious field and to be 
assimilated. By assimilated is meant (a) that 
the experience has an increased degree of 
acceptance and (b) that it is provided an op- 
portunity to enter into mutually modifying 
relations with other processes also available to 
the conscious field. That is to say, in the new 
context it now has changed meaning. Accept- 
ing the segment of experience which has been 
persistently alienated implies a reduction in 
defense operations at the time. This can occur 
in spite of the stressful nature that abreaction 
seems at times to have only if the patient has 
had an increase in ego strength. Such a result 
grows out of the degree of confidence he has in 
the therapist and the treatment procedures. It 
is also therapeutically possible for an abreac- 
tion to be beneficent apart from a therapist or 
institutions, provided the individual can accept 
the products and deal with them with at least 
some degree of reduced defensiveness. This can 
occur only when some current set of factors 
has provided an increase in ego strength as 
defined here. 

Abreaction can be nontherapeutic or eve? 
detrimental if the individual cannot accept the 
revived segment of experience. Revival may 
be brought about by means of drugs, free 2880” 
ciation, hypnosis, or other methods. Since th? 
stress generated will activate the defense SY* 
tem which has maintained the rejection ° 
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alienation, the personality may reconstitute 
itself along habitual lines. This may be thought 
of as simply a nontherapeutic result. The ex- 
perience may be detrimental if the situation in 
which abreaction occurs is now also rejected. 
The therapist, the institution, or the treatment 
procedure may be rejected and the patient is 
then less accessible than he was previously. 
The area of trauma has become enlarged. In 
general, to be therapeutically successful, abre- 
actions should occur only when there is a good 
relationship of confidence and mutual accept- 
ance between therapist and patient. How soon 
this occurs in therapy depends on the person- 
alities of the people involved; it may occur in 
the first interview or it may never occur. 


THE RELATIONSHIP ASPECT OF 
PSYCHOTHERAPY 


_ The importance of establishing and manag- 
Ing a relationship between therapist and pa- 
tient that may be utilized for therapeutic ends 
has long been recognized. This relationship, 
the subject of many treatises, is often described 
in language vague and mystical. It is hoped 
that a step in the direction of a more opera- 
tional approach is accomplished by invoking 
Stress-defense dynamics in this issue. When the 
therapist is accepting and relatively nonjudg- 
mental toward his patient in managing this 
relationship, he is (a) reducing a sense of 
stress and (b) reducing to some degree the 
patient’s need for defense. With less need for 
defense, there is some increase in the permea- 
bility of the barriers to the conscious field. 
Coincidentally, the patient perceives not only 
the therapist’s acceptance but his imperturba- 
bility or lack of surprise at the “terrible” 
things the patient tells him. This apparent 
Strength of the therapist is a support to the 
patient’s own ego strength. Thus, on this basis 
also, there is some increase in the patient’s 
ability to accept into consciousness potentially 
stressful material with increased chances of 
dealing with it constructively. Not only has a 
better basis for interpersonal communication 
been laid, but some facilitation in intrapersonal 
communication may occur. Whole therapies are 
based on providing an individual with an op- 
Portunity to talk out his problems in a non- 
threatening relationship. Some therapists have 
been so convinced of the effectiveness of the 


relationship itself that they have tended to 
renounce analytically derived free association 
and interpretation. One may hypothesize in 
terms of stress-defense dynamics that the as- 
pects of relationship described above are basic 
to any additional elaboration of technique. 
According to Fiedler (8), the “experts” of 
various schools tend to have greatest similarity 
at the point of managing the relationship 
aspect of therapy. 

The facets of the relationship between thera- 
pist and patient often become both subtle and 
complex. In good relationship therapy, the 
patient utilizes the relationship for a broader 
and more complete expression of his own re- 
jected feelings and tendencies. The therapist 
utilizes the relationship to aid the patient in 
becoming increasingly aware of the nature of 
his problems. Thus, according to the regulatory 
theory of consciousness, to become aware 
under conditions of reduced threat is to in- 
crease chances of improved self-regulation. 


Group RELATIONSHIP FACTORS 


The concept of ego strength has been defined 
as the ability to accept into awareness poten- 
tially stressful material and to deal with it 
constructively. It is hypothesized further that 
ego strength, as here defined, is largely a 
product of a sense of security and a sense of 
belonging in a significant way to a group of 
other human beings. If this part of the theo- 
retical frame of reference is accepted, several 
inferences are immediately derivable. Certainly 
one of the first steps in individual therapy is 
the extension of real acceptance by the thera- 
pist to the patient. Some feeling of being 
accepted and at least somewhat secure in the 
two-group relationship is necessary on the part 
of the patient or therapy cannot even get 
started.? An increase in group participation on 
the part of the patient during therapy is both 
diagnostic and therapeutic. This is an indica- 
tion of improvement and a utilization of rela- 
tionships that increase chances for further 
improvement. 

If ego strength is a product of a sense of 
security and belonging, then a look at home 
and community relationships—our typical cul- 
tural pattern—is not very reassuring. One 
needs only to think of the multitude of ways 


2 Cf. Bobbitt and Clausen (1). 
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by which the typical community rejects, neg- 
lects, avoids, or makes invidious distinctions 
among its members to realize how ego-crip- 
pling experiences can hardly be avoided. The 
positive side of this community picture often 
looks good, but most communities unwittingly 
set situations that result in tragic damage to 
some of its members. This is of extreme im- 
portance since the increase of population, 
together with a loss of frontiers, forces people 
to live in closer and closer contacts. People 
must learn to live together constructively or 
they will increasingly destroy each other. 


SUMMARY 


Psychotherapy is defined as a set of tech- 
niques, all of which should have as their com- 
mon purpose the progressive re-establishment 
in the individual of a more adequate degree of 
self-regulation. The pathologies of self-regula- 
tion have been ordered to a set of concepts 
referred to as stress-defense dynamics. The 
essential problem in psychotherapy is the 
recognition and appropriate manipulation of 
these variables in order to effect a maximum 
degree of reversal of pathological processes. 
This systematizing approach has been applied 
to a selected cluster of phenomena related to 
psychotherapy. These are as follows: communi- 
cation and psychotherapy, free association, 
insight, effects of “uncovering” and interpre- 
tation, abreaction, the relationship aspect of 
psychotherapy, and group relationship factors. 
Tt is assumed that the current style of being 
eclectic in psychotherapy demands that cer- 
tain basic concepts be systematically identi- 
fied so that the recognized common factors in 
psychotherapy shall have an organized mean- 
ing. The regulatory theory of consciousness 
and the derived concepts of stress-defense dy- 
namics are offered as a step in this direction. 
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THE EFFECTS OF FEAR AROUSAL AND SUPPRESSION OF FEAR 
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SEYMOUR FESHBACH 


AND ROBERT D. SINGER 


University of Pennsylvania 


HE PRESENT paper is concerned with 
some factors influencing the presence 
and extent of “projection” in social 
perception. The term “projection,” of psycho- 
analytic origin, has accumulated diverse con- 
Notations (4, 5) and has acquired considerable 
explanatory power at the cost of precise anal- 
ysis of the phenomena which it presumably 
explains. Thus, the paranoid who is defending 
against persecutory agents, the miser who 
perceives his neighbors as equally avaricious, 
and the fearful child whose world is full of dan- 
ger and threat are all said to be “projecting.” 
The present experiment is delimited to the 
study of one of the psychological processes 
underlying the various forms of projection; 
namely, the influence of affect upon perception. 
Our basic proposition is as follows: when a 
particular affective state is aroused, there is a 
tendency for the affect to become connected 
to contemporaneous percepts and ideation. 
One might describe this process as the infusion 
of cognition with affect, a process very similar 
to Heinz Werner’s concept of “physiognomic”’ 
perception (6). Freud (3) has also described a 
process of this kind and Bellak (1), in his 
theoretical analysis of apperceptive distortion, 
refers to the role of the mood of the perceiver. 
Infusion is conceived of as a basic, primitive 
organismic process that is especially character- 
istic of infants and young children. It is as- 
sumed that the infusion tendency persists to 
some degree in adults although greatly mod- 
ified by learning. One has to learn not to 
project, as it were; that is, learn to discriminate 
between one’s own feelings and those of other 
objects in the environment. Repression that 
tends to reduce the effects of higher-order 
learning (2) should result in greater projection 
or infusion. If a subject is unaware of or denies 
his feelings, then he is unable to employ an 
important cue to which he has previously 
learned to make discriminating responses. 
These considerations lead to the following 
specific hypotheses: 
1. Under conditions of fear arousal, a subject 
perceives (judges) a stimulus person as sig- 


nificantly more fearful and anxious than under 


neutral affective conditions. 

2. Under conditions of fear arousal, subjects 
encouraged to suppress recognition of their 
emotional reactions perceive a stimulus person 
as significantly more fearful and anxious than 
subjects encouraged to acknowledge their 
emotional reactions. 


METHOD 
Subjects 


Sixty male volunteers from introductory psychology 
classes at a large metropolitan university participated 
as subjects. The Ss were randomly assigned to one of 
the following experimental treatment groups: Control 
(C), Fear-Expression (FE) and Fear-Suppression (FS). 


Administration of Film and Fear Stimulus 


The Ss were seen individually. Each S was told 
that the experiment dealt with the effects of distraction 
upon the accuracy of one’s judgment of other people. 
They were informed that a film of a young man’s 
performance on a number of different tasks would be 
shown to them, and that after presentation of the 
film, they would be required to judge the personality 
of this man. Following these instructions, electrodes 
were attached to the left ankle of Ss in the Fear groups. 
Electric shocks, administered in the guise of distracting 
stimuli, were employed to induce fear. The strength of 
electric shock administered to each S was determined by 
gradually varying the intensity of the shock until S$ 
reported that the shock was painful. The film was 
then presented, and eight shocks were administered 
at varying intervals while the Ss witnessed the film. 
At the same time, they were required to remove a set 
of pins from the O’Connor Finger Dexterity Board, a 
task that could be readily accomplished with only occa- 
sional glances at the board. A similar procedure was 
used for the Control group with the important ex- 
ception that shock was omitted.! 


Manipulation of Fear Expression and Fear 
Suppression 


The instructions intended to encourage expression 
of fear or suppression and denial of fear were given 


1 In order to determine whether the pin removal 
task had any effect upon the Ss’ reactions to the film, 
half the Ss in the Control group were not given this 
task to perform. The means and variances of the 
subsequent judgments of the two Control subgroups 
were practically identical, and consequently the sub- 
groups were combined in analyzing the results. 
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prior to the showing of the film. The Ss in the Fear- 
Expression (FE) group were told: “Many people are 
disturbed by the shock. In order to perform most 
efficiently, the best thing to do is to be aware of and 
admit your feelings. If you freely express your feelings, 
your judgments will be more accurate.” The Ss in the 
Fear-Suppression (FS) group were given contrasting 
instructions as follows: “Many people are disturbed 
by the shock. In order to perform most efficiently, 
the best thing to do is keep your mind off your emo- 
tional reactions and not think about them. Try to 
forget about your feelings and concentrate on the task; 
by so doing, your judgments will be more accurate.” 


Measurement of Experimental Effects 


Subsequent to witnessing the film, Ss were admin- 
jstered a questionnaire regarding the personality 
characteristics of the stimulus person.? Five alternative 
choices were offered for each item in the questionnaire. 
The following four indices reflecting the evaluation of 
different personality attributes of the stimulus person 
were derived from the questionnaire responses: 

Indirect Fear Judgment score. This index, based on 
24 items, reflects the degree to which the stimulus 
person was judged as fearful or anxious in situations 
other than that depicted in the film. Sample questions: 
«How much tension do you think he usually works 
under?”; “How often do you think his sleep is fitful 
and disturbed?”’; “In comparison to most people, how 
much does physical pain upset him?” 

Direct Fear Judgment score. This index, based on 4 
items, indicates the degree to which the stimulus person 
was perceived as fearful and anxious in the actual 
test situation depicted in the film. In making direct 
fear judgments, the S has more stimulus information 
available than in making indirect fear judgments. 
Sample questions: “How worried do you think he was 
while working on the various tasks?”’; “How frequently 
did he seem to be anxious?” 

Indirect Aggression Judgment score. This index, 
based on 13 items, reflects the degree to which the 
stimulus person was perceived as a generally aggressive 
individual. Sample questions: “How frequently do you 
think he loses his temper?” ; “When he gets angry, how 
often do you think he takes it out on somebody else?” 

Negative Personality Judgment score. This index, 
based on 15 items, reflects the extent to which un- 
favorable personality characteristics other than anxiety 
or aggressiveness were attributed to the stimulus 
person. Sample questions: “How much reliance do 
you think one can place upon his word?”; “How tolerant 

is he likely to be of other people’s mistakes?” 

A further control was introduced during the com- 
pletion of the questionnaire in that the shock apparatus 
was detached from half of the Ss in each of the Fear- 


2A copy of this questionnaire has been deposited 
with the ‘American Documentation Institute, Order 
Document No. 5324 from ADI Auxiliary Publications 
Project, Photoduplication Service, Library of Congress, 
Washington 25, D.C. remitting in advance $1.25 for 
microfilm or $1.25 for photocopies. Make checks pay- 
able to Chief, Photoduplication Service, Library of 


Congress. 
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Arousal groups while for the remaining half, the shock 
apparatus was left intact and these Ss were given 
three shocks during this period. Since this condition 
did not result in any significant differences in subse- 
quent judgments, these two subgroups are combined 
in the presentation of the results. 


Apparatus 


The S was seated at a table situated nine feet from 
the screen on which the film was projected. The electric 
shocks were administered by means of a high impedance 
60-cycle inductive generator. The film, which lasted 
approximately 7 minutes, depicted a young man’s 
performance on a card-sorting task, the O’Connor 
Wiggly-Block Test, and the O’Connor Tweezer Dex- 
terity Test.? Neither sound track nor film titles were 
employed. The examiner administering the tests was 
not shown in the film except for a brief glimpse of his 
hands when he changed the test materials. The stimu- 
lus person, who had had previous practice with the 
various tasks, was instructed to complete each task as 
quickly as he could. 


RESULTS 


The means for all groups on each of the 
questionnaire subscales are presented in Table 
1. The results of an analysis of variance of the 
Indirect Fear Judgments are given in Table 2. 
The F ratio is significant at less than the .01 
level, the FS group having the highest Indirect 
Fear Judgment scores and the Control group, 
the lowest. The differences between each of the 
Fear-Arousal groups and the Control group 
are significant at the .01 level. The difference 
between the Fear-Suppression and the Fear- 
Expression groups, although in the predicted 
direction, is not statistically significant. How- 
ever, examination of the individual item scores 
revealed a substantial difference between the 
two groups in the extent to which they used 
the most extreme fear category (alternative 5) 
in judging the stimulus person. As Table 3 
indicates, 60 per cent of the FS group in con- 
trast to 25 per cent of the FE group judged the 
stimulus person as extremely fearful on at least 
one of the items. This difference is significant 
at the .05 level of confidence. Since the differ- 
ences between the Fear groups and the Contro 
group may be taken as measures of projection, 
the results, in support of the first hypothesis, 
indicate that Ss experiencing shock project 


3The authors wish to express their gratitude A 
Dr. William Nicholson who served as the princip? 
character in the film and to the staff of the F. A 
Taylor Management Laboratory of the hartie f 
School, University of Pennsylvania, who gave the 
time and facilities freely in processing the film. 
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their fear onto the stimulus person. There is 
also some support for the second hypothesis; 
namely, that suppression of fear facilitates the 
projection process. 

Included in the questionnaire were four items 
comprising the Direct Fear Judgment Index 
which required S to judge how worried or 
anxious the stimulus person appeared during 
the test situation depicted in the film. Analysis 
of these data yielded results in the same direc- 
tion as the previous findings, but the differ- 
ences were less striking. The data, based on 
only a few items, were much more variable 
than those for Indirect Fear Judgment and, as 
Table 4 indicates, the F ratio obtained is not 
significant. Comparisons between groups based 
on each of the four items comprising the Direct 
Fear Judgment Index revealed no significant 
differences between the FE and C groups and 
between the FE and FS groups. However, on 
one of these items, the difference between the 
FS and C groups is significant at the .05 level, 
on two of the items, at the .06 level, and, on the 
remaining item, the difference is not significant. 

The Ss’ perception of the aggressiveness of 
the stimulus person was considered in the study 
because of the possibility of so-called com- 
plementary projection and also because the 
same operations which aroused fear may also 
have aroused aggression. An analysis of var- 
jance of the Indirect Aggression Judgment 
scores is presented in Table 5. The F ratio is 
significant at the .01 level, the FS group having 
the highest aggression perception scores and 


TABLE 1 
COMPARISON OF MEAN SUBSCALE SCORES 
Group 
Bhs Rear RA if Es 
Subscale Contes! ston pres gee se y 
= 20) Gre (N= 
20) 20) 
Indirect fear 55.9 | 68.9 | 74.2 |2.7**/3.9**/1.1 
(24 items) 
Direct fear (4 8:5 | 9.7 | 11.2) O |2-1* 1.2 
items) 
Aggression (13 | 30.2 | 34.4 38.1 |1.8 |3.3**11.6 
items) 
Negative Per- | 47.4 | 44.6 | 46.3 | —| =|= 


sonality® (15 
items) 


® In the case of the Negative Personality Judgment items, the 
lower the mean, the more unfavorable the judgment. 

* Significant at the .05 level. 

** Significant at the .01 level. 


TABLE 2 
ANALYSIS OF VARIANCE OF THE INDIRECT FEAR 
JUDGMENTS 
Sum of Mean 
Source Squares ay Square 
Between groups 3546.55 | 2 | 1773.3 | 8.04** 
Within groups 12573.65 | 57 | 220.6 


** Significant at the .01 level. 


TABLE 3 


Use or EXTREME FEAR CATEGORIES ON INDIRECT 
FEAR JUDGMENT SUBSCALE 


Group Made 1 or More Made no 
Extreme Fear Extreme Fear 
Judgments Judgments 

Fear suppression 12 8 

(FS) 
Fear expression 5 15 

(FE) 

Note.—x? = 5.8; p < .05. 

TABLE 4 


ANALYSIS OF VARIANCE OF THE DIRECT 
FEAR JUDGMENTS 


Sum of Mean 


Source Squares af Square F 
Between groups 70.23 | 2 | 35.1 2.09 
Within groups 956.10 | 57 | 16.8 | (p< 0) 

> .10 


TABLE 5 
ANALYSIS OF VARIANCE OF THE AGGRESSION JUDGMENTS 
Sum of Mean 
Source Squares of Square 


Between groups 


633.0 2 ST S6 
Within groups 2 


3144.0 |57| 55. 


** Significant at the .01 level. 


the C group the lowest. Thus, it appears that 
aggression as well as fear was projected in this 
situation. 

In view of this finding, one might conjecture 
that the judgments of the Fear-Arousal groups 
are largely a reflection of a generalized negative 
attitude toward the experimenter and the 
experimental situation. If this were the case, 
one would expect the shocked Ss to be more 
likely than the C group to attribute negative 
personality characteristics (other than aggres- 
sion or fear) to the stimulus person. However, 
an analysis of variance of the Negative Per- 
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TABLE 6 
ANALYSIS OF VARIANCE OF THE NEGATIVE PERSONALITY 
JUDGMENTS 
Source Sum of df Mean F 
Squares Square 
7 44.6 2 22.3 56 
Aen a 2266.8 57 39.8 


Within groups 


sonality Judgment scores presented in Table 6 
indicates that differences among the three 
groups are slight and insignificant. These data 
indicate then that differences in perception 
between the experimental groups and the 
control group are specific to fear and aggression 
and cannot be attributed to a generalized 
“halo” effect. 

While not directly relevant to the hypotheses 
under study, the correlation between judg- 
ments of fearfulness (Indirect) and judgments 
of aggressiveness is of some interest. The 
Pearson r calculated for all groups combined is 
.68 which is significant at the .01 level. The 
individual correlations for the C, FE, and FS 
groups are .65, .43, and .72, respectively, and 
are also statistically significant. The fact that 
even in the control group there is a substantial 
relationship between the tendency to perceive 
an individual as anxious and the tendency to 
perceive him as aggressive indicates that the 
obtained correlations are not solely a result of 
the experimental operations. An interpretation 
of this correlation as a reflection of a favorable- 
unfavorable attitude toward the stimulus 
person is suggested by the significant negative 
correlation of —.51 obtained in the Control 
group between the Indirect Fear Judgment 
scores and the Negative Personality Judgment 
scores (the lower the score on this scale, the 
less favorable the judgment). 


DISCUSSION 


The results, when considered as a whole, lend 
substantial support to the hypothesis that the 
arousal of fear results in a tendency to perceive 
another person as fearful and anxious, at least 
within the limits of the experimental operations 
used to arouse fear and to measure its conse- 
quences. In evaluating the significance of these 
experimental results, it is important to note 
that although all Ss were oriented toward 
making accurate personality judgments of the 
stimulus figure, relatively few relevant cues 
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were available for making such judgments. In 
the case where more relevant cues were present, 
such as in those instances where Ss were asked 
to report the actual behavior or feelings of the 
stimulus person in the film situation, the effects 
of fear arousal, and particularly suppression of 
fear, are less striking than where indirect 
anxiety and fear were being judged. It has 
already been suggested that this difference 
may be a function of the difference in number 
of items and, hence, in the reliability of the 
scales used. The available data do not permit 
an assessment of the relative roles of statistical 
reliability and the constraints imposed by the 
stimulus, but there is sufficient theoretical and 
empirical basis for expecting greater perceptual 
distortion, or in this context, greater infusion, 
with reduction in stimulus information. 

Of considerable interest are the results 
bearing upon the hypothesis that suppression 
of fear facilitates the tendency to project fear 
onto other social objects. On the measures of 
both Indirect Fear Judgment and Direct Fear 
Judgment, the Fear-Suppression group at- 
tributes a greater degree of fear and anxiety to 
the stimulus person than does the Fear-Expres- 
sion group. When the two groups are compared 
with respect to the use of extreme fear cate- 
gories, the difference between them is statis- 
tically reliable. These data, as well as those 
for Indirect Aggression Judgment, are then 
consistent with the hypothesis that suppression 
of affect facilitates projection. However, the 
effects of this variable are not as clear-cut as 
the effects of fear arousal, per se, and the 
results do not unequivocally support this 
hypothesis. 

It must also be noted that the hypothesized 
explanation of the effects of suppression in 
terms of avoidance of cues essential for dis- 
crimination between the affective experience 
of the perceiver and the affect of the perceived 
object, is only one of several that can be offered 
to account for the experimental findings. Thus; 
although both experimental groups were reas- 
sured that many subjects are disturbed and 
upset by the experimental conditions, | the 
Suppression group may have felt addition@ 
anxiety because of inability to suppress their 
feelings and thereby conform with the a 
structions. Also, greater conflict in this groul 

. s level 9 
may have contributed to a greater of 
tension. Thus, the differences in amount 
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projection between the Fear-Suppression and 
Fear-Expression groups may have been func- 
tions of differences in the amount of fear and 
aggression experienced. Data pertinent to the 
issue of the mechanism responsible for the 
effects of suppression could be furnished by 
experimental research involving positive emo- 
tions such as sympathy and joy. However, 
whatever the mechanism by which suppression 
and other variables may influence projection, 
it is important that one first establish the par- 
ameters of the projection or infusion process. 
Thus, the present results indicate that while 
suppression or “repression” of affect is not 
essential in order for projection to take place, 
it enhances the degree of projection present.* 
It is instructive to examine the differences 
between the results of this study and those of 
Murray’s classic study of projection (4). 
Although both studies deal with the effects of 
fear, we obtained supplementary projection 
and possible complementary projection while 
Murray obtained complementary projection 
only; that is, the children of Murray’s study 
perceived the stimuli as malicious rather than 
fearful. An obvious explanation of this differ- 
ence lies in Murray’s procedure which involved 
rating of the pictures only with regard to 
maliciousness. However, we suggest that even 
if Murray had obtained judgments of fearful- 
ness, changes in the ratings of fearfulness would 
have been less marked than changes in the 
ratings of maliciousness. A critical variable in 
our view is the relationship between the 
perceiver and the stimulus person. In Murray’s 
study, the children were asked to judge pictures 
of adult males. Adults similar to the latter are 
likely to have had threatening roles in previous 
situations in which children have experienced 
fear, and consequently their faces were per- 
ceived as malicious. In terms of our theoretical 
framework, we would predict that if, under 


4It is possible to conceive of a circumstance in 
which “suppression” of affect might lead to the absence 
of projection. Thus, if the S were completely successful 
in restricting his responses to those which were incom- 
patible with fear responses, he might not experience 
any fear at all, and consequently he would not project 
since the present theory supposes that the response 
produced affective cues must be part of the stimulus 
field in order for projection to occur. The concept 
“denial” is perhaps more descriptive than are “sup- 
pression” or “inhibition” of the situation in which a 
particular affective response is elicited but the S is 
motivated not to recognize it as such. 


similar experimental conditions, pictures of 
children were used as stimuli, supplementary 
projection would be obtained; that is, the 
perception of the affective state of the stimulus 
person would directly match the affective 
state of the perceiver. 

These considerations serve to emphasize that 
the final judgment or perception of a social 
object is the outcome of a number of interacting - 
variables, one of which is the affective state of 
the perceiver. 


SUMMARY 


The present experiment was designed to 
study the effects of fear arousal and suppression 
of fear upon social perception. It was hypoth- 
esized that individuals subjected to a fear- 
producing situation would tend to project 
their feelings upon some social objects, and 
further, instructions to inhibit emotional reac- 
tions would increase the amount of projection. 

The Ss were randomly assigned to one of the 
following experimental treatment groups: Fear 
Expression, Fear Suppression, and Control. 
Shock was administered at varying intervals 
to Ss in both Fear groups while they witnessed 
a film of a young man’s performance of a num- 
ber of different tasks. Prior to presentation of 
the film, Ss in the Fear Expression group were 
encouraged to recognize any feelings they might 
experience because of the shock. In contrast, 
Ss in the Fear Suppression group were en- 
couraged to suppress and inhibit their emo- 
tional reactions. The Control group also witnes- 
sed the film but did not receive any shock. 
Subsequent to witnessing the film, all Ss com- 
pleted a questionnaire dealing with the per- 
sonality characteristics of the individual in the 
film. 

The following results were obtained: 

1. The Fear-Suppression and the Fear- 
Expression groups perceived the stimulus per- 
son as a significantly more fearful and also 
significantly more aggressive individual than 
did the Control group. 

2. The differences between the Fear-Suppres- 
sion and the Fear-Expression groups were 
consistently in the predicted direction. The 
difference between these groups in their mean 
fear judgment scores was not statistically sig- 
nificant. However, a significantly greater 
proportion of the Fear-Suppression group in 
comparison to the Fear-Expression group used 
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extreme fear categories in describing the 
stimulus person. 

3. The experimental differences are more 
striking for judgments of the fearfulness and 
anxiety of the stimulus person in situations 
other than that immediately depicted in the 
film. 

4. The tendency to perceive the stimulus 
person as a fearful and anxious individual was 
significantly correlated with the tendency to 
perceive him as aggressive. 

The data indicate that the arousal of fear 
results in a tendency to project fear onto a 
stimulus object in the environment. The 
results are also consistent with the hypothesis 
that suppression facilitates the tendency to 
project although, in this latter instance, one is 
less confident in rejecting the null hypothesis. 
Several alternative explanations of the effects 
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of the suppression variable were considered and 
the role of cognitive variables in the projection 
process was discussed. 
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THE MASCULINITY-FEMININITY DIMENSION IN NORMAL AND 
PSYCHOTIC SUBJECTS! 
MAX R. REED? 
Washington University, St. Louis 


ganization or structure to adjustment 

level or prognostic outcomes, the clini- 
cian must routinely attempt to assess the 
influence of the masculinity-femininity dimen- 
sion. Psychoanalytic theory (3, 7, 8, 20) as- 
signs a central role to this aspect of personal 
organization. Masculinity-femininity is viewed 
as a complex affair, derived from the combined 
effects of a number of personality determi- 
nants. Explicit relationships are postulated 
between genetic occurrences, body image or 
body feeling, and levels of ego organization (3, 
pp. 35-113, 418, 419). Early, severe disrup- 
tions in the sexual identification processes are 
thought to bring about major disturbances in 
body image organization (20, p. 252). The 
result is conceived as an admixture of cross- 
sex elements, a condition discordant with the 
body image essential to one’s normal sexual 
role. 

Such states are accounted for in psycho- 
analytic theory by postulating differences in 
levels of ego organization at the times at which 
early body image organization, primary iden- 
tifications, and secondary identifications occur. 
Body image organization and primary identi- 
fications are thought to occur at a preverbal 
level. Fenichel (3, p. 107) suggests that primary 
identifications and the deepest layer of the ego 
are constructed through kinesthetic, olfactory, 
and visual sensations. On the other hand, 
secondary identifications and the superego are 
later developments, based primarily on audi- 
Thus, there may be a wide dis- 
crepancy between the way a person knows that 
he should feel or act sexually and the way that 
he does feel or is capable of acting. These 
re believed to be important con- 
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tributors to a weakening of ego organization, 
the essential condition for psychotic illness. 

The development of tests reflecting different 
levels of masculinity-femininity organization 
has paved the way for experimental investiga- 
tion in this area. One line of development has 
made use of drawings of one type or another 
which, it is believed, reflect the basic mascu- 
linity-femininity organization. Franck and 
Rosen (6) devised a Drawing Completion Test 
(DC) which successfully differentiated college 
men and women. They reasoned that the dif- 
ferences in performance on this instrument 
reflected basic sex differences in body structure 
and psychosexual organization. The Draw-A- 
Person Test (DAP) is another drawing task 
believed to reflect sex differences in body image 
or body feeling and sexual organization. A 
number of studies (1, 4, 9, 11, 14, 15, 16, 22) 
with the DAP have yielded contradictory find- 
ings, but the test’s full potential is by no 
means exhausted. 

A second development has been the con- 
struction of verbal inventory measures which 
are thought to reflect the masculinity-femi- 
ninity aspect of personality organization. 
Prominent among these measures is the MMPI 
Masculinity-femininity Scale (Mf), an empiri- 
cally derived inventory yielding significant 
differences between men and women. Shepler 
(21) has shown that this test measures some- 
thing different from the DC. While he found 
that both the Mf scale and the DC test differ- 
entiated men and women, he also found a low 
and insignificant correlation between the two 
test variables. 

The DC, Mf, and the DAP were used in the 
present study to test the following hypotheses, 
derived from psychoanalytic theory, concern- 
ing the nature of differences in the masculinity- 
femininity organization of normal and psy- 
chotic women: 

1. Psychotic women obtain more masculine 
scores on measures of body image than do 
normal women. (a) With the DC as a measure 
of body image, psychotic women yield more 
masculine scores than do normal women on 
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With the DAP as another 


such measure, the relative body proportions 
of normal subjects’ male and female drawings 
approximate more closely actual anatomical 
measures of such proportions than do tis like 
body proportions of psychotic subjects’ figure 
drawings. 

2. The verbal statements of normal women 
concerning their masculinity-femininity status 
correspond more closely to their respective 
measures of body image status than do those 
of psychotic women. (a) Psychotic women 
show larger differences between their DC and 
Mf scores than do normal female subjects. 
(b) Psychotic women obtain higher (more 
masculine) DC scores than MMPI Mf scores, 
whereas normal female subjects obtain DC 
scores that are roughly equal to or lower (more 
feminine) than their corresponding Mf scores. 


this measure. (b) 


METHOD 


The study included an initial test of the 
hypotheses (designated Experiment I) and a 
replication study employing independently 
chosen samples and independent scorers (des- 
ignated Experiment IT). Experiment II was 
planned to examine any refinements of the 
original hypotheses which Experiment I might 
suggest and to cross validate the findings. 


Selection of Subjects 


The control Ss for Experiment I were 22 nurses 
chosen randomly from one class of nurses in training 
at the St. Louis State Hospital. The control Ss for 
Experiment II were 28 nurses chosen in the same man- 
ner from the immediately following class of nurses at 
the same institution. The mean age for these Ss in 
Experiment I was 20.63 years with a standard devia- 
tion of 2.49. For Experiment II the mean age was 
20.14 with a standard deviation of 1.19. All Ss were 
unmarried. 

Twenty-five female psychotic Ss were selected for 
each experiment. The majority of these Ss for Ex- 
periment I was obtained from the St. Louis State 
Hospital, and for Experiment II a majority was drawn 
from the Elgin State Hospital at Elgin, Illinois. The 
hotics were matched with the nurses on level of 
education, but it was not possible to match the two 
groups on the age variable. The mean age of the psy- 
chotic Ss of Experiment I was 29.28 years with a 
standard deviation of 3.21. In Experiment I, the 
mean age for the psychotic Ss was 28.52 years with a 
standard deviation of 3.54. In Experiment 1 fifteen Ss 
were married and ten single. Sixteen Ss were — 
and nine Ss were single in Experiment II. In Experi- 
ment I, the Ss had been hospitalized from a eae 
of one to a maximum of nine years A the ra 
length of hospitalization falling at 3.88 years. In 
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Experiment II, the length of hospitalization ranged 
from one to ten years with the mean at 3.96 years. 
The psychotic Ss in no case exhibited central nervous 
system involvement, nor had any S undergone shock 
therapy of any type within three months immediately 
preceding test administrations. 


Tests and Scoring Procedures 


All Ss were administered the DC, the group form of 
the MMPI, and the usual DAP procedure. 

The MMPI Mf. The published hand key was used 
to score the Mf scale of the MMPI for all Ss. 

The DC. The DC protocols were all scored according 
to the criteria given in Franck’s mimeographed manual 
(5). Although Franck and Rosen's (6) published scoring 
reliabilities on female normative groups are high, it 
seemed desirable to obtain interscorer reliabilities on 
the different populations used in the present study. 
When the writer and two additional scorers with 
graduate training in psychology scored the 25 psychotic 
records used in Experiment I, the resulting reliability 
coefficients were .79, .82, and .66. The writer’s scoring 
was used for all statistical evaluations in Experiment 
I. In Experiment II, the independent scoring of one 
of the graduate students who participated in the 
first experiment was used. 

In order to investigate Hypothesis 2 (b), DC standard 
scores were computed and the scale inverted for pur- 
poses of comparison with the MMPI Mf. Thus, for 
this comparison, a low DC standard score is in the 
feminine direction, whereas a low DC raw score is 
in the masculine direction. 

The DAP. The following body measures were taken 
from each figure drawing protocol: overall height, 
waist height, neck height, above waist height, and 
arm length. On the basis of Dickinson’s (2) findings 
that they reveal most clearly and consistently differ- 
ences between males and females in actual measure- 
ments, the single measure of overall height and the 
ratios of arm length/waist height, above waist /waist 
height, and waist height/neck height were employed 
to test Hypothesis 1 (b). In these measures, the pre- 
dicted proportional relationships for normals are as 
follows: (a) male figure drawing taller than corre- 
sponding female, (6) male proportions of arm length/ 
waist height larger than the like female figure propor- 
tion, (c) female proportion of above waist/waist height 
smaller than the respective male figure proportion, 
and (d) female proportion of waist height/neck height 
greater than the respective male figure proportion. An 
additional summary measure consisted of the number 
of Ss for whom all of the differences in body propor- 
tions between the male and female figure drawings 
were in the normal anatomical direction. 

Asa check on the reliability of these measures, inter- 
scorer correlations were obtained for the DAP proto- 
cols of the psychotic Ss employed in Experiment I 
Three scorers with graduate training in psychology 
measured all of these figure drawings. Product-moment 
correlations were calculated between the three sets of 
measures obtained. The resulting 30 reliability co- 
efficients ranged from .91 to .99. Consequently, only 
the E’s measures were used in the statistical tests of 
hypotheses. 
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TABLE 1 
Comparison or DC SCORES ror NURSES AND 
PsycnHorics 


Experiment I | Experiment II 
Item m |P | p 

Turse Sy- |N | +Sy- 

di chotics Nurses chotics 
No. of Cases 22 25 28 25 

Mean 23.36 | 15.60 | 20.14 | 15.00 

SD , 3.41 2.70 sE bs} 2.59 

Combined median 18.00 17.60 

No. above median 18 | 3 25 6 
No. below median 7 |17 3 19 
Chi square 8.546 8.902 
<.005 <.005 


a Hi TETEE E a ag 
* Higher scores are in “feminine” direction. 


RESULTS 


From Table 1 it is seen that the psychotic 
Ss in both experiments obtained significantly 
less feminine DC scores than the normal Ss. 
The nonparametric median test (18) yielded 
Significant chi-square values ( < .005) for 
the two comparisons. (All chi-square values 
obtained in this study were corrected for 
continuity.) 

While these findings strongly support the 
hypotheses advanced, the possibility must be 
considered that factors other than the experi- 
mental variable contributed to the differences 
in DC scores. Since it was not possible to 
equate control and experimental groups 
for age, product-moment correlations were 
calculated between the DC scores and the Ss’ 
ages. The values for the psychotic Ss were .20 
and .04 for Experiments I and IT respectively. 
For the nurses, the corresponding correlations 
were .42 and —.09. All correlations in the 
present study were tested for significance using 
the r-to-z transformation (17). None of these 
values are significant. Thus, there seems to be 
little appreciable influence of age on DC scores. 

Evaluation of the relationship of marital 
status to DC performance was limited by the 
nature of the samples. In Experiment I, the 10 
single psychotic Ss obtained a mean DC score 
of 13.7 with a standard deviation of 2.3, 
whereas the 15 married Ss obtained a mean of 
16.8 and a standard deviation of 2.2. In Ex- 
periment II, the 9 single Ss obtained a mean 
DC score of 14.3 with a standard deviation of 
3.1, and the 16 married Ss obtained a mean of 
15.3 with a standard deviation of 2.1. For 
Experiment I the difference is significant (¢ = 


3.41, p < .01), but for Experiment II the 
difference is nonsignificant (¢ = .79, p > .10). 
Although the absolute numerical means of the 
married Ss are more feminine than the DC 
means of the total psychotic groups, they are 
nevertheless significantly more masculine than 
the corresponding: nurses’ mean scores (Ex- 
periment I, / = 7.13, p < .01; Experiment II, 
t = 6.11, p < .01). Thus, in this study there 
is no conclusive evidence for a systematic re- 
lationship between DC scores and marital 
status. 

Length of hospitalization similarly seemed 
to have no relationship to DC performance 
(Experiment I, r = —.12, p > .05; Experi- 
ment II, r = .30, p > .05). Thus, there is no 
evidence that extraneous factors account for 
the obtained differences in DC test perform- 
ance. 

Two additional factors must be considered 
in evaluating the DC score differences ob- 
tained by the experimental and control groups. 
First, the nature of the test suggests the possi- 
bility that simplified responses, a result of 
impoverished psychotic ideation, might be the 
major source of the more masculine scores ob- 
tained by the experimental groups. However, 
a rough classification of the drawings into 
simplified versus several other categories re- 
vealed that only 27 per cent of the psychotic 
drawings scored as masculine could be con- 
sidered simplified. Since 25 per cent of the 
drawings scored as feminine could also be 
classed as simplified, there is little evidence 
that the more masculine scores of the experi- 
mental group resulted primarily from simpli- 
fied drawings. 

Second, in any group of Ss scoring similarly 
on the test, several different styles of drawings 
are found. This finding is in agreement with 
the results of Franck and Rosen (6). It is 
possible that such variations may be in some 
systematic way associated with different types 
of feminine organization of different adaptive 
values. Further work with the DC, aimed at 
refinements in the scoring system, should add 
to an understanding of the relationships of 
masculinity-femininity structure to levels of 
ego strength or adaptive capacity. 

The DAP results, presented in Tables 2 and 
3, give less convincing support for the first 
hypothesis. Only three of the ten chi-square 
values are significant. 
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TABLE 2 
Group COMPARISONS ON DAP MEASURES 


Max R. REED 


TABLE 4 
TEST INTERCORRELATIONS 


Experiment I 


Psy- 
Nurses | chotics Chi b 
DAP Measure Smite 
Fo | Mb | Fe | MP 
Height 13 9 7 |16 2.669 <.20 
Arm length/Waist height 11 | 1{10| 10) 4.088 | <.05 
Above waist/Waist height 18| 4| 12] 12) 3.811 <.06 
Waist height/Neck height a} 1)15| 8| 4355| <-05 
Experiment II 
Psy- 
Nurses i i 
DAP Measure aione ghi a 
Fs | Mb | Fa | M> 
TT a ad a 
i 8 | 14 2.706 +10 
Height 16 | 12 
Arm length/Waist height 12| 3| 9| 4] 0.427 <.70 
Above waist/Waist height |20| 2|12| 7 3.118 | <.07 
6| 2.814 <.08 


Waist height/Neck height 2| 1413 


a Subjects producing drawings yielding body proportions in 


normal anatomical relationships. 
b Subjects producing drawings distorted by cross-sex pro- 


portions. 


TABLE 3 
Group Comparisons ON COMBINED DAP MEASURES 


Experiment I Experiment II 


Item 
Nurses Psy- Nurses Psy- 
chotics chotics 


Number producing 100% 11 4 14 4 
body porportions in 
normal direction 
Number producing less 11 18 2 13 
than 100% body pro- 
portions in normal di- 
rection 
Chi square 3.642 
$ <.06 


5.430 
<.02 


Again, there was no systematic relationship 
between DAP measures and age. Product- 
moment correlations between the two variables 
ranged from —.44 to .31 with none of the 
values reaching significance. This finding is 
consistent with that of Lehner and Gunderson 
(14) that the figure drawings of women show 
no effects of age differences in the range from 
20 to 40 years. 

Inspection of Table 4 reveals no conclusive 
evidence for a systematic relationship between 
the DAP measures and other test scores. None 


of the correlations between the DAP measures 


and Mf scores are significant. Only one DAP 
measure yields a significant correlation with 


Experiment I Experiment II 


Measure Test 

Nur-| Psy- |Nur-| Psy- 

ses |chotics | ses | chotics 

Waist height DC 15) — .13 .00|— .56** 
Mí .04| .17 |—.10| .09 
Arm length DC .37|— .65**|— .54| .10 
Mf |—.23| .00 |—.11|—.29 

Neck height DC -08)— .20 .12|— .50** 


Mi | .07| .21 | :04|—.10 
DC |—114| :16 | -00 


Above waist š 
Mf |—.02|—.12 .08|— .08 


Height DC .07|— .96**| .04|— .42* 
Mf -09) .18 00) .32 
Correlation between —.21| .34 -07| .06 
Mf and DC 
Note.—Only measurements of female drawings appear in table. 
* p <.05. 
.. p <.01. 


TABLE 5 


COMPARISON OF EXPERIMENTAL Groups on DC-Mf 
STANDARD SCORES 


Experiment I Experiment IL 


Below Above 


Item Above Below 
Md. Md. Md. Md. 
Nurses 17 5 18 10 
Psychotics T 18 8 16 
Chi square® 8.546 4.950 
p <.005 <.04 


® Chi square based on median test. 


the DC scores in both experiments, and this 
only with the psychotic groups. Thus, there is 
considerable evidence in this study that the 
DAP is measuring something different from 
both the DC and Mf. 

In agreement with Hypothesis 2 (a), the 
psychotic Ss of both experiments obtain larger 
absolute difference scores (Mf standard scores 
minus DC standard scores). Table 5 presents 
the significant chi-square values obtained 
when the median test was applied to these 
differences. However, some clarification of this 
finding is needed. The result is due primarily 
to the more feminine DC scores obtained by 
the normal Ss rather than to any differences in 
Mf scores between the two groups. The M 
contributed little in discriminating the tw? 
groups. P 

A significantly larger number of psychotic 
than normal Ss obtained DC scores that were 
more masculine than their corresponding M 
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TABLE 6 
Group Comparisons on MMPI Mf SCALE 
(Mí measures in raw scores) 


Experiment I Experiment II 


Nurses Psychotics Nurses Psychotics 


N 28 25 

M 36.10 35.50 
SD 6.82 4.80 
t 1.045 

p >.30 


Scores, giving strong support for Hypothesis 
2 (b). Chi-square values for differences ob- 
tained in Experiments I and II, respectively, 
are 14.41, p < .001, and 4.05, p < 05. 

From Table 6 it can be seen that the Mf 
alone, in agreement with the findings of 
Renaud (19), is of little value in discriminating 
between major diagnostic groups differing 
greatly in sexual organization, and contributes 
little toward differentiating the normal and 
Psychotic groups. The findings of Gough (10) 
that psychotic Ss obtain more variable scores 
on the Mf scale than do normal Ss are not 
borne out. Product-moment correlations cal- 
culated between Mf and DC test scores give 
further evidence, in agreement with Shepler 
(21), that these two instruments assess differ- 
ent attributes. For the normal Ss, the correla- 
tions were —.21 and .07 for Experiments I and 
II respectively. For the psychotic Ss, the cor- 
responding correlations were .34 and .06. For 
samples of this size, these values may not be 
considered as different from zero. 

In order to evaluate better the relative 
efficiency of the two types of measures (DC 
scores and difference scores) in differentiating 
the criterion groups, the data were combined 
from the two experiments, and optimal cutting 
points were determined for the distributions of 
DC and difference scores. Table 7 shows that 
the DC scores alone discriminated much more 
effectively. The optimal DC cutting score of 
17.5 misplaced 18 Ss. Seven nurses (14 per 
cent) fell on the psychotic side of the distribu- 
tion. Eleven psychotic Ss, 22 per cent of the 
total psychotic group, fell in the nurses’ end of 
the distribution. The optimal difference score 
(Mf standard score minus DC standard score) 
of +-5.5 misplaced 33 Ss. Twenty-four nurses, 
48 per cent, fell at the psychotic end of the scale 
while nine, or 18 per cent, of the psychotic Ss 
Were misplaced. The correct selections for both 


TABLE 7 
DISTRIBUTION OF DC AND DIFFERENCE SCORES ABOUT 
OprmaL Curtinc Ports, COMBINING Data FROM 
ExpermMents I anD II 


DC Scores Difference Scores 
sea Above Below Above Below 
17.5 17.5 +5.5 +5.5 
Nurses 43 7 26 24 
Psychotics 11 39 9 41 
Chi square 39.668 12.362 
p <.001 <.001 


tests are significantly beyond chance expect- 
ancy (p< .001 in both instances). 

A word of caution is added here. Although 
some of the differences obtained between 
criterion groups in the present study are highly 
significant, considerable overlap occurs in all 
cases. Even though the two experiments were 
conducted on independently chosen groups, 
the samples were small and some drop is 
evident in the significance levels obtained in the 
second experiment. More extensive and refined 
investigation needs to be carried out in this area 
before these measures can be employed diag- 
nostically or for purposes of individual 
prediction. 


Discussion 


The results of this study are interpreted as 
being in general agreement with hypotheses 
derived from psychoanalytic theory (3, 7, 8, 
20) concerning the nature of differences along a 
masculinity-femininity dimension in normal 
and psychotic women. 

The evidence supports the hypothesis that 
there exist two rather clearly defined levels of 
functioning within this dimension. One level 
seems to be associated with or bound to 
kinesthetic or perceptual-motor facets of ex- 
pression and can be thought of as falling 
within the psychoanalytic id structure. The 
second level of organization appears to be more 
closely associated with verbal expression, and 
thus seems more closely associated with cul- 
tural expectations of the superego. For the 
psychotic groups in this study, disturbances at 
the nonverbal level of organization are crucial 
and are indicated by markedly masculine 
performances on the DC. It is suggested that 
such a deviant organization provides a core of 
unacceptable impulses and a steady source of 
life stress which contribute to the psychotic Ss’ 
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problems of adaptation and to their eventual 
breakdown. Extensive research is needed to lay 
the groundwork for a more refined and com- 
prehensive theory concerning this aspect of 
human development. 

Future research must also determine whether 
the differences in DC scores represent differ- 
ences in developments in the identification 
process or reflect some basic somatic dif- 
ferences, such as differences in hormonal func- 
tion, or both. In addition, the relationship of 
styles of sexual adjustment to general adaptive 
functioning, and the relationship between 
masculinity-femininity organization and psy- 
chosis in male populations need to be explored. 


SUMMARY 


The present study was designed to test 
hypotheses derived from psychoanalytic theory 
that psychotic and normal women differ in 
their responses to measures of different levels 
of masculinity-femininity organization. The 
DC and the DAP were employed as nonverbal 
measures of body image status, and the MMPI 
Mf Scale was administered as a verbal meas- 
ure, On the DC, the psychotic women obtain 
more masculine scores than do normals. How- 
ever, using the DAP as a measure of body 
image, only two of four measures yielded sig- 
nificant differences, and these did not hold up 
on cross validation. The verbal statements of 
normal women concerning masculinity-fem- 
ininity status correspond more closely to their 
respective measures of body image status than 
do those of psychotic women. ; 

A word of caution was added concerning the 
DC. Although some of the differences obtained 
with the test between criterion groups were 
highly significant, the amount of overlap was 
too great to allow its use for individual 
prediction. 

Theoretical implications of the study were 
discussed, and areas for future research 


indicated. 
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GALVANIC SKIN RESPONSE AS A MEASURE OF PATIENT'S 
REACTION TO THERAPIST’S PERMISSIVENESS! 


JAMES E. DITTES 


Yale University 


ost theories of the dynamics of psy- 

chotherapy agree on the importance 

of an attitude manifested by the 

therapist toward the patient, which is variously 

characterized as accepting, warm, nonthreaten- 

Ing, or permissive. But it is not always made 

clear what types of behavior define this “ac- 

Ceptance” or what is its function within the 
Situation of psychotherapy. 

“Acceptance” or “permissiveness” may be 
defined as an absence of various forms of 
Punishing behavior which a patient is accus- 
tomed to receive from other persons. The 
nature of such anticipated punishments differs 
among individuals but frequently includes 
Such cues as these: other persons fail to pay 
close attention to one’s words; others seem un- 
willing or unable to understand one’s own 
Point of view or remarks; one has difficulty 
making his points understood; not only do 
Others fail to give a fair hearing, but they also 
Insist vigorously on their own point of view, 
Interrupting, contradicting, disregarding and 
Not attempting to reconcile disagreement; they 
Merely announce their position, as though one’s 
own view isn’t even worth considering. Such 
types of behavior are punishing, in large part, 
because they communicate to a person that he 
is regarded with low esteem and as of negligible 
importance. Less subtle forms of such punish- 
ment are outright criticism, rebuke, expressions 
of rejection, and clear indications that one is 
considered inadequate in some respect and 
should change. Sympathy and friendliness 
may be expressed, but in a patently forced and 
unconvincing manner which says to a person 
that he is of little enough importance that he 
can be dealt with in a manipulative, contrived 
manner and not as an equal. 


1 This study is part of a psychotherapy research 
Program supported by United States Public Health 
Service Grant M-648 and supervised by Dr. John 
Dollard. The data used here had been previously ob- 
tained by Dr. Dollard and Dr. Frank Auld, Jr. Ap- 
Preciation is due Dr. Auld, Dr. Dollard, and Dr. 
George Mahl for making suggestions on the manu- 
Script and to Dr. Auld, Mrs. Ann Byler, and Mrs. 

ichele Morris for serving as judges in ratings of the 
Verbal material. 


Such types of behavior are functionally 
equivalent whether they are experienced as 
threats of punishment or as painful punish- 
ment in themselves, since such experiences be- 
come either threatening or painful as they be- 
come associated with clearly painful expe- 
riences, such as extreme rejection and 
withdrawal of important satisfactions; and 
whether as punishment or as threat of punish- 
ment, such cues motivate avoidance behavior. 
For example, the stimulus of having one’s 
words not attended to carefully may frequently 
be associated with the frustration of not 
receiving what has been requested or with such 
clearly painful experiences as being ignored or 
otherwise denied signs of regard or affection by 
the same person who has half-heartedly listened 
to one’s words. Such a cue comes to motivate 
avoidance whether it is regarded as itself 
punishing or merely as a signal threatening 
punishment. 

To such punishment or threats of punish- 
ment, the initial response of a person is likely 
to be some state of reaction which may be 
characterized as tension, anxiety, alertness, 
mobilization, a state which undoubtedly has a 
variety of physiological components, and which 
comes to be followed by, and thereby lessened 
by, some type of behavior which serves to 
escape or otherwise defend against the punish- 
ment. Within small groups, for example, it has 
been demonstrated that cues indicating low 
acceptance of a person by other group members 
results in increased conformity by that person 
to norms of the group, even in the face of evi- 
dence contradicting the norms (3). Among the 
types of behavior which are learned in response 
to such threats of socially administered punish- 
ment are probably those which are of interest 
to a therapist as neurotic symptoms and as 
defensive resistance within therapy. 

In therapy, ‘“‘permissiveness’’—the absence 
of such punishments and threats in the behav- 
ior of the therapist—should serve two related 
functions (4). First, the absence of such threat- 
ening cues should allow the patient to speak 
and act without eliciting in him the stress or 
tension and accompanying defensive avoidance 
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measures which are ordinarily evoked by such 
cues in the behavior of others. Second, when- 
ever either the therapist’s behavior or the 
patient’s own thoughts or other behavior 
actually do produce cues which have been 
accompanied by punishment in the past and 
hence serve as threats, the failure of these to be 
reinforced by actual punishment from the 
therapist should serve to extinguish the effec- 
tiveness of the cue as a threat: for example, the 
failure to be heard clearly will no longer elicit a 
state of tense readiness followed by some 
defensive measure; it will be found not to be a 
signal of punishment. ; 

In practice, it is unlikely that any behavior 
of the therapist, even the most permissive, 
completely avoids cues which are threatening 
to the patient. As a protective, adaptive meas- 
ure, a neurotic patient has probably learned to 
respond to a wide variety of cues as though 
they were threats and has acquired a “low 
threshold” of perception and response to such 
cues. . 

The resistive or defensive responses elicited 
by any such cues during therapy are probably 
difficult to detect and to measure for several 
reasons. There is much individual difference in 
the forms of such reactions. Any individual has 
a large repertory of such responses, and very 
different ones may be elicited by slightly dif- 
ferent situations. Because many forms of 
resistance and defense themselves are fre- 
quently punished, particularly the most overt, 
the patient learns to be very subtle inthe resist- 
ant response; the resistant, avoidant behavior 
—particularly of a well-practiced defender, as 
most patients are—probably deviates only 
very slightly, by the minimum possible, from 
ordinary nonresistive behavior, so that the 
resistance itself is most likely to escape de- 
tection and consequent punishment. 

A potentially more stable indicator of a 
patient’s perception and response to cues of 
threat is the physiological measure of the gal- 
vanic skin response. Probably alone among 
physiological responses, the GSR is activated 
solely by the sympathetic and not the para- 
sympathetic system and is generally character- 
ized as indicating a state of “mobilization” or 
“preparation” (1, 7). It seems plausible to sup- 
pose that the GSR reflects the general tension 

r “anxiety” or state of readiness which is the 
immediate reaction to a threatening cue. 


DITTES 


The GSR record of a patient in therapy, 
then, should show a relationship with the 
“permissiveness” of the therapist, as defined in 
terms of the absence of various punishments or 
cues threatening punishment, and measures of 
resistance should show an increase with de- 
creased permissiveness and increased GSR. 


METHOD 


This study is based on the records of the 
final 43 hours of therapy with a 36-year-old, 
white, male, neurotic patient, seen by an 
experienced therapist three or four times a 
week. The patient previously had 30 weekly 
sessions with the same therapist before therapy 
was recessed for the summer. The patient 
terminated therapy at the end of the series 
reported here. All sessions were electrically 
recorded and verbatim transcripts typed. A 
continuous record of the patient’s palmar skin 
conductance was made during each session 
with apparatus previously described (2). In 
one session, the 14th of the present series, 
recording equipment failed, and this is excluded 
from the study. 


Estimation of T) herapist’s Permissiveness 


Using the typed transcripts, two judges 
independently evaluated the remarks of the 
therapist, using ratings of one to four on each 
of four scales intended to represent aspects of 
permissiveness. Hours were judged in random 
order. The four scales were defined as follows: 


Understanding. The therapist shows that he under- 
stands the patient’s thoughts and feelings as the patient 
means and feels them. The patient has no trouble 
getting his points across; the therapist is not stupid 
or obtuse. The therapist’s interventions suggest 
thoughts or feelings which the patient can Tecognize as 
his own and do not seem to him farfetched, 

Gentleness. Therapist fails to show any of the signs 
of rejection or disapproval which the patient may have 
come to expect from others. He does not condemn, 
criticize, scold, judge, belittle, or use sarcasm or irony- 
He does not give evidence of a feeling of the patient’s 
inferiority by urging the patient to change some 
present behavior, by interrupting or trying to channel 
the patient’s speech, or by insisting on his own inter- 
pretations either openly or subtly; he speaks tentatively 
and offers his notions only as a possibility for the 
patient to consider. 


OO r N] 


GSR AND REACTION TO THERAPIST’s PERMISSIVENESS 297 


, Over-all acceptance. The total impact of the thera- 
Pist’s activity is likely to make the patient feel that 
a is Unconditionally accepted, respected, valued, 

loved,” is an Important person, not under judgment 
or threat of potential punishment, is “safe” to venture 
thoughts and feelings which have been punished in 
his Previous relations with other people. 


Judgments on each scale were made for the 
total hour and for smaller units of the ther- 
apist’s remarks within each hour. A unit was 
defined as all the remarks of the therapist 
Separated on the typescript by less than a page 
of the patient’s remarks. The therapist units, 
other words, were separated from each other 
bya page or more of the patient’s speech. Most 
hours had from four to six such units. Agree- 
ment between the two judges in judging the 
Whole hours with the four scales summed 
yielded a correlation of .69%, and the correlation 
of judgments in the smaller units within hours 
Was .61. Four principal scores were used. 

Whole hour permissiveness is the sum for the 
four Scales of judgments of the hour as a whole, 
With both judges’ scores summed. Scores 
Tanged from 15 to 32 with the median and mean 
at 21. This score was used as the measure of 
permissiveness in all results reported unless 
Some other score was specifically indicated. 

Initial permissiveness was measured by av- 
€raging the ratings on all therapist units in the 
first five pages of typescript. 

Opening friendliness. The behavior of the 
therapist in greeting the patient and in any 
interaction during application of the GSR 
electrodes was judged by two judges with 
respect to “friendliness,” defined as responsive- 
ness to incidental social overtures of the 
patient or as initiation of such conversation, 
Independent ranking of 36 hours by two judges 
yielded a Spearman rank-order correlation of 
-87. The six remaining hours were those both 
Judges eliminated from the sorting as providing 
insufficient basis for judging; in most of these, 
only a bare greeting was recorded. The average 
tank of the two judges was used as the score 
for the hour on this measure. 

Change in permissiveness. Instancesof marked 
change in permissiveness within any hour were 
defined as any change of four or more points 
(representing an average of one or more points 
for each of the four scales) between two succes- 


* All correlation coefficients reported in this paper 
ae Product-moment correlations except as otherwise 
‘oted. 


sive therapist units. Nineteen such instances 
occurred in the 42 hours. 


Scoring of GSR Record 


A GSR was scored whenever the continuous 
record of conductance met one of the following 
criteria: (a) a rise at the rate of 4000 or more 
ohms per second with an amplitude of at least 
6500 ohms; or (b) a rise of at least 10,000 ohms 
amplitude at a rate of 1333 or more ohms per 
second. Such a change was of the order of 3 
per cent of the basal resistance level. These 
criteria had been used previously (2) and were 
found to be easily distinguishable with the 
calibrated paper used; and had been used to 
distinguish between clear-cut abrupt rises in 
conductance and much smaller fluctuations. 

The GSR measure primarily used in this 
study was the frequency of such rises for each 
50-minute therapy session. This frequency was 
approximately normally distributed with a 
mean of 45 and median at 43, ranging from 11 
to 90. In a randomly selected set of half the 
hours, this frequency score correlated .95 with 
the total amplitude of GSR deflections within 
each hour. In the same sample of hours, the 
GSR frequency correlated —.42 with the 
amount of decrease during the hour of the 
basal resistance level, as calculated by dividing 
the amount of change in ohms by the beginning 
resistance in ohms. 

For obtaining a GSR score within any part 
of an hour, the rate of GSRs per minute was 
used. This varied between .25 and 3.2 GSRs 
per minute in the hours examined. The cri- 
terion for determining whether a change in rate 
occurred was a difference of .25 or more GSRs 
per minute between successive portions of an 
hour. 

To measure GSR rate following any therapist 
intervention, GSRs were counted from five 
minutes after the beginning of the therapist 
unit until five minutes after the beginning of 
the next unit, and the rate per minute com- 
puted. It was reasoned that the effect of the 
therapist’s speech on the GSR would appear 
only after the patient had heard a substantial 
amount of the unit. The time of five minutes 
was arbitrarily selected before the data were 
examined. The unit of the therapist’s speech 
typically covered somewhat more than this 
period. 

To measure the GSR preceding a therapist 
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unit, GSRs were counted from the beginning of 
the preceding therapist unit to the beginning 
of the relevant therapist unit, and the rate per 
minute computed. It was assumed that this 
period best represents the unit of patient 
behavior which might be suspected of affecting 
the behavior of the therapist in the subsequent 
unit. 


Measures of Patient’s Behavior 


Embarrassing sex statements. Two judges, 

working independently from the transcripts, 
identified any statement by the patient clearly 
acknowledging personal sexual desires and 
behavior. Reliability for these judgments by 
these judges has previously been shown to be 
represented by phi coefficients of the order of 
.80 (2). A statement was scored as an embar- 
rassing sex statement when selected by both 
judges. 
a Two judges independently rated 
the patient’s remarks as to whether they dealt 
with emotionally significant material or were 
resistive in the form of discussing less emo- 
tionally significant material, such as mechanics 
of therapy, trivial daily activity without emo- 
tional involvement, or another person’s prob- 
lems. The purpose of this measure was to check 
the possibility that a friendly, accepting man- 
ner by the therapist could be related toa low 
GSR score because it encouraged the patient to 
behave more as though he were in a friendly 
social conversation and hence discuss material 
less likely to involve emotions and the GSR. 
Ratings from 1 to 4 were made for each third 
of an hour and for the whole hour. The sum of 
these four judgments by both judges was taken 
as the resistance score. This score ranged be- 
tween 8 and 24 with a mean of 17 and median 
at 18. Agreement between two judges yielded 
a correlation coefficient of .40. 

Interruplions in patients speech. As another 
possible indication of the kind of resistance 
which might be encouraged by a friendly 
therapist, a count was made of the number of 
times in each hour that the patient paused in 
his speech for five seconds or more. It was as- 
sumed that more fluent, less interrupted speech 
might accompany more sociable and less emo- 
tionally significant remarks and that more fre- 
quent pauses might accompany more sig- 
nificant material. Although using a slightly 
different indication of fluency and silence, Mahl 
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(5) has demonstrated that increased silence is 
associated with increased anxiety and that 
decreased silence is apparently an indication 
that other resistances than silence are operat- 
ing. In the present study, the correlation of this 
score, number of pauses, with the judgment 
of resistance described above is —.29 (p = .06), 
i.e, higher resistance is associated with fewer 
pauses. 

The frequency of pauses was also used in the 


analysis as a possible indication of depression 
in the patient. 


RESULTS 
Relation between Permissiveness and GSR 


Each of the indicators of therapist’s permis- 
siveness showed a significant inverse relation- 
ship with frequency of galvanic skin response. 
Table 1 reports product-moment correlations 
between the measures. 

Since it was also found that the GSR is 
related to the emotional significance of the 
patient’s speech, as is reported in more detail 
below, this relationship partially obscures the 
relationship between therapist’s permissiveness 
and GSR. If the emotional significance of the 
patient’s remarks is made more homogeneous 
by eliminating the hours most extreme in 
resistance and those most extreme in emotional 
significance of content, the relationship be- 
tween permissiveness and GSR becomes 
sharper. The last two columns of Table 1 
report the correlation between permissiveness 
and GSR, disregarding, in one column, the 
hours which represent the top and bottom 
eighths on the distribution of resistance scores, 
and in the other column, the hours which 
represent the top and bottom eighths on the 
distribution of pause scores. 

Changes in permissiveness from one hour to 


TABLE 1 
RELATIONSHIP BETWEEN PERMISSIVENESS AND GSR 


Product-moment Correlations* 


pietas i Be 
Measure of Permissiveness Peele Pateh 
All Hours|—————— 

Resist- | Pauses 

ance 

Whole hour permissiveness | —.51 | —.65 | — -73 
Initial permissiveness —.41 | —.44 | —-49 
Opening friendliness —.52 | —.58 | —-59 


* All relationships are significant at beyond the .01 level. 


Le 
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the next were accompanied by expected 
changes in the frequency of GSR. Fourteeen 
hours showed a decrease of two or more points 
im permissiveness from the next previous hour. 
Nine of these showed an increase in frequency 
of GSR, four a decrease, and one no change. Of 
14 hours showing an increase in permissiveness 
of two or more points, three increased in GSR 
frequency and 11 decreased. This relationship 
yields a chi square of 7.0 (p < .01). This anal- 
ysis disregards the three hours which, because 
of vacation or illness, followed the previous 
hour by a lapse of a week or more. 

Eighteen embarrassing sex statements oc- 
curred, distributed among eight of the hours. 
A significant tendency was found for the ESS 
to be accompanied by a GSR in hours scored 
low on permissiveness and not to be accompa- 
nied by a GSR in more permissive hours. 

Of 10 ESSs in four hours above the median 
on whole hour permissiveness, four are ac- 
companied by a GSR and six are not. Of eight 
ESSs in hours below the median on permis- 
Siveness, seven are accompanied by GSR and 
one is not. This yields a chi square of 4.1 (p < 
-0S). If the ESSs are sorted at the median ac- 
cording to the permissiveness score of the 
therapist unit immediately preceding the ESS, 
the relationship shows a chi square of 5.8 
(p < .02). If opening friendliness is used as the 
Measure of permissiveness, the relationship 
yields a chi square of 7.1 (p < .01). 

Although there is evidence that judges agree 
on the implicit unitizing in scoring ESSs (2), 
the unit of an ESS is not explicitly defined, and 
there accordingly exists the possibility of bias 
in determining whether or not a GSR accom- 
panies a particular ESS. Therefore, an alter- 
native analysis was made, using as an objec- 
tively defined unit of the patient’s speech all 
his remarks occurring between pauses of five 
seconds or more or between remarks by the 
therapist. The rate of GSR per minute during 
speeches in which an ESS occurred was cal- 
culated. The average rate of GSRs for all the 
periods of ESS speech in hours below the 
median in permissiveness is 1.8 and for hours 
above the median is .8. The difference in rates 
between the two sets of ESS speeches is shown 
by a / test to be significant beyond the .05 level. 

The four subscales originally intended to 
define permissiveness proved not to contribute 
Equally to the relationship with GSR. The 
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TABLE 2 
RELIABILITIES AND INTERCORRELATIONS OF FOUR 
PERMISSIVENESS SUBSCALES WITH REGRESSION 
WEIGHTS FOR MULTIPLE CORRELATION 


wita GSR 
| 
| Intercor- | Regres- 
Correla- | relations | Correla- | son 
Subscale esa =a With) Weight 
Judges | Ë | gi g | GSR With 
| |5 [ojs] 
Attentiveness 7} .35].50) — .38 24 
Understanding |.48|.57) —.32 | —.03 
Gentleness | 85 —.51 -42 
Acceptance | | —.48 .02 
multiple correlation between the four scales 


and GSR is —.55. The interjudge reliability 
for each scale, the intercorrelations, and the 
regression weights are shown in Table 2. The 
gentleness scale alone correlates as well with 
the GSR (r = —.51) as does the sum of the 
four scales. Apparently, understanding and 
over-all acceptance are not related to GSR 
except as they comprise the factor which may 
be labeled asgentlenessora nonpunishing man- 
ner, the chief component of permissiveness 
with which the GSR is related. 


Alternative Interpretations 


The general interpretation of these results is 
that the degree of apparent permissiveness or 
threat in the behavior of the therapist is re- 
sponded to by the patient in a way which may 
be measured by the GSR. Before this inter- 
pretation is discussed in more detail, several 
alternative interpretations will be considered 
with additional results reported as they pertain 
to each one. 

It could be contended that the GSR and the 
therapist’s manner are both consequences of 
some aspect of the patient’s behavior. For 
example, it may be suspected that on occasions 
when the patient is depressed, the GSR re- 
flects this fact with a lower frequency and that 
the therapist also responds to this depression 
with a more kindly, sympathetic manner, which 
is judged more permissive. Or perhaps the 
patient comes to certain hours with a particular 
anxiety which results in more GSRs and which 
also results in a resistive kind of behavior with 
which the therapist attempts to deal by using 
more radical and direct interventions, which 
are judged less permissive. 

There is evidence, however, that variations 
of permissiveness within hours are followed 
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INCREASE IN 
GSR RATE 
NO CHANGE IN ma 
GSR RATE Cr E 


DECREASE IN 
GSR RATE 


DECREASE IN 
PERMISSIVENESS 


INCREASE IN 
PERMISSIVENESS 


Fic. 1. EFFECT OF CHANGES IN THERAPIST'S 
PERMISSIVENESS ON SUBSEQUENT GSR RATE 


DECREASE IN 
GSR RATE 


INCREASE IN 
PERMISSIVENESS 


DECREASE IN 
PERMISSIVENESS 


INCREASE IN 
GSR RATE 


Fic, 2. Errect or CHANGES IN GSR RATE ON 


Trerapist’s SUBSEQUENT PERMISSIVENESS — 
Omitted are approximately 60 instances in which a 
change in the GSR rate is followed by no change in 
permissiveness. 


but nol preceded by corresponding inverse 
variations of the GSR. These data are reported 
in Figures 1 and 2, which record all within-hour 
changes of four or more points in permissiveness 
between two successive therapist units. The 
tendency of a change in permissiveness to be 
followed by a change in rate of GSR in the 
direction predicted by the interpretation held 
here yields a chi square of 2.7 (p = .05, one 
tail). But there is no relationship between 
changes in permissiveness and changes in the 
preceding GSR rate. These results support the 
interpretation that the GSR is in fact a 
response to the therapist’s behavior. — 

With reference to the more specific pos- 
sibilities that either depression or resistance of 
the patient affects both GSR and permissive- 
ness, a possible measure of depression, fre- 
quency of pauses, though correlated with 
permissiveness (r = .38, p < .02) is not cor- 
related with frequency of GSR (r = .14). 
Resistance as judged in this study is corre- 
lated inversely with permissiveness (r = —.53, 
p < 01) and only slightly with frequency of 
GSR (r = .22). The frequency of pauses score 
just reported may also be considered as an 
indication of resistance. 
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Another type of interpretation would explain 
the relationship of permissiveness and GSR as 
an artifact. It might be suspected that the 
therapist was judged more permissive and 
friendly when he talked more, thus leaving the 
patient less time to think and speak his own 
thoughts, and that listening is less likely to 
produce GSRs than is speaking. However, a 
count of the frequency of the therapist’s speech 
(number of times therapist spoke between 
utterances of the patient) shows a negative 
correlation with permissiveness (r = —.33, 
p < .05) and no relation with GSR frequency 
(r = .07) 

Perhaps the most likely alternative explana- 
tion is that permissiveness, as defined here, 
encourages certain kinds of resistance by 
making the patient feel that the therapy is 
more like a social occasion, leading him to 
speak of routine, less emotionally involved 
material; hence, fewer GSRs are to be expected. 
Three types of evidence are available on this 
point. 

First, this type of resistance was judged with 
moderate reliability between two judges but 
showed little relation with GSR (r = .22), and 
furthermore yielded a negative relationship 
with permissiveness (r = —.53) contrary to 
that proposed by the explanation. 

Second, the other measure of resistance, 
number of pauses by the patient, was also 
found related to permissiveness (r = .38, 
p < .02) but not to GSR (r = 14). 

Third, variability in the patient’s expression 
of emotionally significant material was pre- 
sumably held constant in the judging of embar- 
Tassing sex statements, andas reported above, & 
clear relationship between permissiveness and 
GSR remains when such a uniform type of 
expression is examined. Of further suggestive 
importance is the fact that ESSs occur as fre- 
quently in permissive as in nonpermissive 
hours, contrary to what would be expected by 
the alternative explanation. ESSs did not, 
however, occur in hours of moderate permis- 
siveness. All of the ESSs occurred in hours 
which rated either in the upper or lower thirds 
on the distribution of permissiveness. 


Relation of Resistance with Permissiveness and 


GSR 


Both measures of resistance show that in 
hours of moderate permissiveness (permissive- 
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TABLE 3 
RELATION or RESISTANCE AND GSR at DIFFERENT 
LEVELS OF PERMISSIVENESS 


Correlation of GSR with 


», Degree of Number of 
Permissiveness Hours Judged _ Frequency of 
Resistance* Pauses 
High 12 02 33 
Moderate 15 -48 -67 
Low 15 —.15 -16 
All hours 42 = 22 14 


a Sig A i A 
a Signs of the resistance judgments are reversed so that, as with 
h t i IS 

€ pause score, a high score represents more emotionally signifi- 
cant, less resistive, content. 


hess scores within two points of the mean), 
there is a clear relationship between frequency 
of GSR and the extent to which the patient’s 
Speech comprises emotionally significant, non- 
resistive content. These correlations are report- 
ed in the second line of Table 3. Within this 
Same group of hours, the correlation of GSR 
and permissiveness is slightly positive (r = 
-16); presumably the over-all trend of an in- 
verse relationship between GSR and permis- 
Siveness is obscured by the strong relationship 
of GSR and content of patient’s speech in these 
hours. The relation of GSR and emotionally 
Significant content disappears when permis- 
siveness becomes more extreme in either di- 
rection and presumably becomes the stronger 
determinant of the GSR as shown in the first 
and third lines of Table 3. 

Each measure of resistance shows a sig- 
nificant correlation with permissiveness over 
all the hours. Correlation of permissiveness 
with judged resistance is —.53 and with 
pauses is .38, in each case indicating more 
emotionally significant, less resistive content 
accompanying higher permissiveness. 


Discussion 


The behavior of a therapist who preserves, 
in general, a nonpunishing, warm, accepting 
relationship with a patient may, nevertheless, 
include detectable variability in this character- 
istic, even within a narrow range, and offers 
to the patient at least occasional cues which 
he may regard as threatening or punishing. 
Such cues would include any behavior which 
may, correctly or incorrectly, communicate to 
the patient that the therapist regards him as an 
inferior, inadequate, or otherwise unworthy 
and unrespected person. Apparent failure by 
the therapist to understand the patient’s words 


301 


or his interruption of the patient, or some form 
of scolding, or his apparent determination to 
force his own interpretations and points of view 
on the patient may be among such cues. 

It appears confirmed by this study that a 
patient may react to such cues with a response 
which includes the galvanic skin response and 
which may represent a general tenseness and 
a defensive readiness to cope with the danger. 

An episode occurring in one hour provides 
illustration and strong support for these 
interpretations. Hour 46, the contents of which 
were unknown to the investigator until near 
the end of the study, marks a distinctive 
change in the frequencies of GSR. Of the 14 
preceding hours, only three have GSR fre- 
quencies under 50. Of the 28 following, only 
four have GSR frequencies over 50. A rank 
correlation (rho = .49, p < .01) was found 
between sequence and GSR frequency, but 
this is due almost entirely to the change at 
this hour. Rank correlation for hours 46 and 
after is —.01 and for hours before 46 is .23. 
(There is a slight correlation of .24 between 
permissiveness and sequence, but this is not 
significant, and there is no corresponding 
sharp difference at hour 46.) 

About one-third of hour 46 is devoted to 
interpretations by the therapist concerning his 
apparent friendliness to the patient and to 
responses by the patient on this subject. In 
response to the patient’s observation that he 
can’t communicate as freely as he consciously 
wants to, the therapist remarks, “I think you 
have to like somebody if you want to com- 
municate. You have to feel free. . . Who’s will- 
ing to confide or tell their private thoughts to 
other than a friendly person?” The patient, in 
effect, confirms the implications of the inter- 
pretation by reporting that he has felt the 
therapist to be brusque and unfriendly, as 
evidenced by his failure to call the patient by 
his first name and his reluctance to engage in 
social banter before and after the therapy ses- 
sions. He reports his feelings during an episode 
in the previous day’s therapy when, “I was so 
anxious or falling all over myself. . . trying to 
impress you. . . as though I were rushing in to 
fill a gap that seemed to be necessary to fill to 
maintain a relationship, for instance. .. . I say 
that type of thing often where I feel that I 
got to rush in and patch up a rupture rather 
than widen the rupture or act natural or expose 
myself.” 
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These words of the patient seem to support 
the theoretical interpretation offered in this 
paper. In speaking of the “relationship,” he 
presumably refers to various cues representing 
the therapist’s regard and esteem for him, cues 
on which he is highly dependent. He indicates 
that the absence of such cues becomes a painful 
experience and motivates frantic efforts to 
defend against such a possibility. Such cues 
as not being called by his first name in spite of 
a long acquaintance and not finding a friendly 
response to efforts to initiate light social con- 
versation apparently have been responded to 
by the patient as though they were indications 
of absence of esteem or some other punishing 
attitude, as in fact they normally would be in 
most social experience, and this response has 
included high GSR activity. Apparently, the 
effect of the therapist’s interpretation and the 
patient’s reply has been to let the patient 
realize that the therapist, unlike most other 
persons in his social experience, may be im- 
personal without being hostile or punishing. 
These particular cues in the therapist’s 
behavior, though they continue to exist, lose 
their effectiveness as threats, and the GSR 
activity immediately drops and varies around 
a lower level. , 

The preliminary attempt to separate differ- 
ent aspects of permissiveness on separate 
scales results in the finding that absence of 
certain types of social punishment, defined 
here as “gentleness,” proved to have the most 
significant relation with the GSR. In view of 
the unique and special emphasis sometimes 
given to “understanding” as the expression of 
a therapist’s “acceptance” (6), it is of some 
interest to note that, as judged here and for 
this patient, this aspect of permissiveness is 
not related to the GSR independently of the 
scale which is most explicitly defined as the 
absence of punishment. ahs 

The judgment of resistance in this study was 
made with only moderate reliability and cor- 
relates only moderately with the objective 
measure of resistance, pause frequency. How- 
ever, the two independent measures are highly 
consistent in their relationship with GSR 
frequency and with permissiveness and yield 
results that corroborate the theoretical inter- 
pretations proposed above. 

If certain cues in the therapist’s behavior 
are responded to with anxiety and fear of 
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punishment, it would be expected that such 
anxiety or fear would be further increased 
when the patient’s own behavior or speech in- 
cludes characteristics that may seem more 
“punishable” than when his behavior or speech 
is more “innocent.” The combination of some 
cues in his own behavior that have been fre- 
quently followed by punishment plus cues in 
the therapist’s behavior that threaten punish- 
ment should evoke more anxiety (and GSR) 
than the latter cues alone, Personal behavior 
which may have been punished in some way 
by others would include the discussion of sexual, 
hostile, and other feelings toward intimately 
related persons—behavior, in other words, 


which was judged in this study as emotionally 
significant, less resistive. 


Within the limited range of moderate per- 
missiveness, this is precisely the relation which 
was found: the emotional significance of the 
patient’s speech is significantly correlated (7 = 
48 or .67, depending on which measure is 
used) with the GSR, and is a more important 
determinant of the GSR than is permissive- 
ness within these hours. 

At lower levels of permissiveness, however, 
the findings are different. GSR is high and 
resistance is high (both being clearly corre- 
lated negatively with permissiveness), but the 
GSR and resistance are not correlated. These 
results suggest that anxiety (and GSR) are so 
uniformly high in response to the low permis- 
Siveness that they evoke generally resistive, 
nonintimate speech by the patient. The GSR 
is attributable chiefly to the low permissive- 
ness, and not to emotionally significant 
content, because the latter is not present. 

At high levels of permissiveness, GSR and 
resistance are both low, and again not signifi- 
cantly related with each other. Apparently, the 
permissiveness js great enough that the patient 
Speaks freely of emotionally important mate- 
rial and does so without feeling under any 
threat. The cues of discussing emotionally vital 
matters may evoke GSR only as these cues 
enter into a kind of multiplicative relationship 
with cues of threatening behavior by the 
therapist. The results obtained with the ESS 


judgments further support these interpreta- 
tions. 


SUMMARY 


. i e- 
Frequency of a patient’s galvanic skin the 
Sponse is shown to be inversely related to 
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judged permissiveness of the therapist during 
a series of 42 hours of psychotherapy. The 
ISR is interpreted as a measure of the anxiety 
of the patient, or his “mobilization” against 
any cue threatening punishment by the thera- 
PISt, such as any cue which the patient has 
learned to perceive as evidence of low esteem. 
This anxiety presumably motivates varying 
Subtle forms of resistance and defense against 
the feared punishment. Several alternative in- 
terpretations of the relation between permis- 
Siveness and GSR are rejected by various 
findings. 

Two different measures of resistance are 
Negatively correlated with the permissiveness 
of the therapist. However, the attempt to show 
that f this relationship is mediated by the 

anxiety” or “readiness” which the GSR meas- 
ures is obscured by the finding that the GSR 
1S related to the emotional significance of the 
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patient’s speech, as well as to the permissive- 


ness of the therapist. 
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THE INFLUENCE OF THE MOTHER’S PRESENCE ON CHILDREN’S 
DOLL PLAY AGGRESSION! 


HARRY LEVIN anp VALERIE F. TURGEON 
Cornell University 


URING the course of socialization, the 

mother teaches her child skills, beliefs, 

values, and brings under control the 
child’s free expression of his impulses. Among 
the impulses to be controlled, aggression pre- 
sents special problems. For one, the very 
nature of training creates frustrations that are 
likely to result in aggressive behavior. Such acts 
of aggression must in turn be controlled, a 
process likely to instigate more of the change- 
worthy behaviors, a sort of vicious circle. The 
socializer thereby becomes an instigator to the 
very action she is trying to change. 

In most instances, however, mothers do 
succeed in bringing under control the aggres- 
siveness they deplore. Through the punishment 
of aggression, the conditions that instigate ag- 
gression also instigate the anticipation of the 
pain that has, in the past, accompanied such 
actions. We expect that when the inhibitions 
are greater in strength than the urges to be 
aggressive, the child will not behave directly 
in the disapproved manner. The mother, then, 
comes to both instigate and to inhibit the 
child’s aggressions. 

Assume that the training at home is the 
point of origin on a continuum of stimulus 
generalization. Following Miller’s displacement 
model (4), the greater the difference of a given 
stimulus situation from the relevant aspects of 
the home, the greater the probability of the 
overt occurrence of aggression, within the limit 
that aggression is still instigated. Sears and his 
co-workers (e.g., 2, 7, 8) have conceptualized 
doll play aggression in these terms. The doll 
play situation is assumed to be similar enough 
to the home to instigate aggression yet different 
enough to escape some of the inhibitions. 

The doll play setting has certain superficial 
similarities to the home: a doll family, house, 
and furniture. The differences are also impor- 
tant and designed to relax inhibitions against 
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aggression; the people are, after all, dolls, and 
the furniture and rooms are miniatures. Also, 
the experimenter is deliberately friendly and 
nonevaluative. The prediction has been that 
with inhibitions against aggression relaxed in 
doll play, the amount of aggression the child 
shows is a function of the degree of instigation, 
usually measured by how severely the child 
has been punished at home. 

The present study asks how aggression in 
doll play is affected if the doll play setting is 
in some ways made more similar to the home 
by having the child’s mother present at a doll 
play session. The expectation from the fore- 
going line of argument is that aggression should 
be reduced. 


METHOD 


Since doll play usually involves a single adult 
interacting with the child, a control was neces- 
sary to insure that the child’s doll play behavior 
would be attributable to the presence of the 
mother and not to the circumstance that two 
adults were present. Consequently, the control 
group’s doll play was observed by an adult 
female, previously unknown to the child. The 
results from this study may also be compared 
with the several extant sets of norms of doll ` 
play behavior in which the preschool child 
participated only with the adult experimenter 
(3, 6). 

The design, then, involved two groups of 
children, an experimental and a control group: 
For each group there was a control and an 
experimental session. In the first session, both 
groups were treated alike; only the experimen- 
ter was present. In the second session, tbe 
experimental group were observed by theif 
mothers in addition to the experimenter, aP 
the control group by a stranger and t i 
experimenter. 


Subjects 


The Ss were 20 children, 10 boys and 10 girls, aged 
three to five years. They came from upper mi ae 
class, professional homes; 13 of the 20 were enrole 
in the Cornell University Preschool. The children W° r 
first separated into groups of the same sex and @ 
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Within each age-sex group the children were assigned 
randomly to the two treatments. 


Procedure 


The Ss were observed in two sessions of standard, 
experimental doll play, each lasting 20 minutes. There 
Was usually one day intervening between sessions, and 
in no case was the interval longer than three days. 
The doll play procedure has been described in detail 
elsewhere (6). A standard family of dolls was used: 
ets father, boy, girl, and baby. The doll house 
i six uncovered rooms; the furniture was movable, 
Put the walls were fixed. The same adult female (VT) 
Was the experimenter in all sessions. The first session 
volved only the E and the child. 

At the second session, the stranger-present (control) 
Ss entered the experimental room with the Æ to find 
an unfamiliar female adult in the room. This person 
acted as an observer who did not speak either to the 
E or the S, but simply sat watching the session. 

The mother-present Ss entered the second session 
p companied by their mothers and the Æ. The mothers 
had been previously instructed to watch the session 
but not to start interaction with the child. If the 
child talked to them they were to reply in a natural 
manner, 

A The sessions were observed from behind a one-way 
vision screen, The doll play record was unitized at the 
time of the observation into subject-predicate units 
which were recorded verbatim. “The boy runs,” “the 
daddy spanks the girl” are examples of single units. 

scoring convention somewhat different from other 
doll play studies was adopted in that repetitive se- 
quences such as the “boy runs and runs and runs” 
Was scored as one unit. From the protocols two scores 
were derived: (a) total number of units per session 
and (b) per cent aggression per session, computed as 
the number of aggressive units divided by the total 
number of units per session. 

Our attempt to score the agents and objects of the 
aggressive acts was not profitable. Usually, the child 
failed to specify either the agent or the object of the 
act. 
Aggression was defined as any thematic action that 
irritates, hurts, injures, punishes, frustrates, or destroys 
a doll or equipment. It ranged in intensity from in- 
nocuous mischief and the violation of usual household 
Tules (the boy jumps on the bureau) to severe accidents, 
fights, and catastrophes (the father breaks his leg, the 
boy has his head sliced off, a storm breaks up the house). 
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Verbal agression such as scolding and threats were 
included, as were instances in which discomfort was 
attributed to a doll (the boy is sick, sad, lost, etc.). 

Four sessions were observed by two observers. The 
four protocols were combined and agreement computed 
according to the formula: % Agreement = number of 
agreements/number of agreements + number of dis- 
agreements. For total units, the agreement between 
observers was 90%; for aggression units, the agreement 
was 83%. 


RESULTS 


The score computed for the aggression anal- 
ysis is the percentage of total units which are 
aggressive. This score controls for the overall 
level of general verbal thematic productivity of 
the child. For the various analyses, the per- 
centage scores were normalized by use of the 
angular transformation, p = sin’ ¢. 

To ascertain the comparability of the ex- 
perimental and control groups, an analysis of 
variance of the aggression scores in the first 
session, prior to the experimental treatment, 
was performed. The difference in the average 
per cent aggression between the two treatment 
groups was not significant. 

Since the results of the present experiment 
are to be compared with existing doll play 
norms, it is relevant to inquire whether the doll 
play behavior of the present Ss in the initial 
session diverges significantly from that of the 
groups on which norms are available. From a 
study of 202 boys and 177 girls, Levin and 
Sears (3, p. 149) report that the average per 
cent aggression scores in the initial doll play 
session is 14.28 for boys, and 8.64 for girls. The 
mean and .05 fiducial limits for the ten boys in 
this study is 23.10 + 14.14; and for the girls, 
6.90 + 4.98. While the obtained means are not 
as similar as might be desired, these findings 
are compatible with the view that doll play 
performance in this situation is typical of other, 
larger samples of children. 


TABLE 1 
AGGRESSION Scores IN Two Dott Pray SESSIONS 
(N = 5 in each cell) 


Mother Present 


Stranger Present 


Sex Session I Session IL 


Session I Session IT 


Mean* 


SD Mean SD 


Mean SD Mean SD 


Boys |16.40 +: 21.87 


Girs | 6.40 + 13.04 | 10.49 |10.60 Æ 17.49 


17.62 |41.20 - 36.48 | 29.39 129.80 =+ 24.48 | 21.34 |23.60 + 20.7 
14.08 | 7.40 + 2.72 


1 | 16.69 
2.19 | 3.80 + 7.25 | 5.85 


* The mean is followed in each case by fiducial limits at the .05 level. 
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TABLE 2 


SUMMARY OF ANALYSIS OF VARIANCE OF AGGRESSION 
Scores IN Two Dott Pray SESSIONS 


Mean A 
Source df Squares F p 
| 

Treatment 1 11.02 — — 
Sex 1 2666.68 | 9.30 | <.01 
Sessions 1 126.02 | — — 
Treatment X Sex 1 1:77] — — 
Treatment X Ses- 1 | 746.50 | 7.66 | <.02 

sions | | | 
Sex X Sessions 1 191.85 | — — 
Treatment X Sex | 1 34.22 | — — 

X Sessions | 
Error (b) (16) | 286.85 
Error (w) (16) | 97.41 | 


The questions under investigation concern- 
ing the influence of the mother’s presence in 
the second session were approached through 
an analysis of variance of normalized per cent 
aggression scores according to the following clas- 
sifications: treatment, sex of S, and session. 
Treatment, sex, and the interaction between 
the two involve the sums of the aggression 
scores for the two sessions. For these three 
sources of variation, the appropriate error term 
is the variance estimate, among Ss, within sex 
and treatment (labelled “Error b” in Table 2). 
The differences between the scores in the two 
sessions enter into the main effect of “sessions” 
and the various interactions between sessions 
and the other variables. Since the aggression 
scores in the two sessions are correlated, the 
error term for testing the differences between 
sessions is the interaction, Ss X sessions, within 
sex and treatment (labelled “Error w” jn 
Table 2). 

For the aggression scores in both sessions 
combined, boys are more aggressive than girls, 
This finding replicates a number of other 
studies that reported sex differences in doll play 
behavior (1, 3, 5, 6). 


The important results for the present study’ 


are the changes from Session 1 to Session 2. 
As can be seen in Table 2, the session X treat- 
ment interaction is significant at about the .02 
confidence level. Aggressiveness clearly in- 
creases from the first to the second session when 
the mother is watching the child and decreases 
when a stranger is present. Every one of the 
ten experimental children increased in aggres- 
siveness when the mother was present. By a 
sign test, the probability of there being no 
reversals in a series of ten observations is .01, 
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Among the children in the presence of a stran- 
ger, the aggression scores of eight decrease in 
the second session; two children, one boy and 
one girl behaved differently from the majority. 

The results in the experimental and control 
groups may be analyzed separately. Tested 
against a hypothesis of no difference, the in- 
crease of aggression in the group with the 
mother present is statistically significant 
(t = 3.34, p < .01). For the control group, the 
decrease is not significant (¢ = 1.00, p < 30). 

The usual occurrence in doll play with only 
the £ present is that aggression increases in 
the second over the first session. Is the increase 
in the present experimental group different 
from what might be expected in a normal 
second session without mother or stranger 
observing? According to available norms (3; 
p. 149) the mean increase in per cent aggression 
between sessions is 7.56 for boys and 4,12 for 
girls. For the present Ss, the differences 
between sessions and the .05 fiducial limits of 
these differences are 24.80 + 30.37 and 4.20 
£4.91 for the boys and girls in the experi- 
mental group and —6,2)-+-28.04 and —3.60 
£5.18 for the boys and girls in the control 
group. The only group that differs from the 
Scores expected on the basis of previous results 
is that of girls observed by a stranger, who 
appear to be inhibited in their aggressiveness. 


Discussion 


The prediction derived from displacement 
theory that the mother’s presence would result 
in a decrease in her child’s fantasy aggression 
Was opposite to the results actually obtained. 
In light of the present findings, the displace- 
ment model appears to be an over- 
simplification, R 

In the first place, the theory assumes that m 
the parental treatment of aggression any frus- 
tration accumulates a potential for the child 
to be aggressive and any punishment for 28- 
gression. tends to inhibit this behavior. Instig®- 
tion and inhibition with respect to aggressio? 
are viewed as nonspecific, leading under, Ki 
propriate conditions to behavior that 15 
resultant of the instigation and inhibi 
This formulation may be scrutinized both : 
terms of how well it fits the complexity y 
aggression training and as to whether doll Pa o 
as used in the present study, provides gie 
conditions for manifesting the behavior- 
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Press. evidence having to do with young 
et en’s aggression indicates that parents 
ihe pomdan differently depending on the 
a e o the aggressive acts and to whom the 
ana are directed. For example, Sears, 
RT pend and Levin (9) found, in a survey of 
erally ag sy! Sao that mothers tend gen- 
toward a be intolerant of aggressions directed 
Šehiti hemselves, somewhat more tolerant of 
still = ang quarreling among siblings, and 
With rane of their children s fighting 
shar L aymates. These distinctions are often 
See ee for the child to discriminate 
Some aggressions, and to learn that under 
ee ae is not at all con- 
iot E hereas in other situations fighting is 
miy y permitted but even encouraged. It 
children ma even with the mother present, 
ne n had categorized the laboratory situa- 
puke one appropriate to express their im- 
S, especially since their mothers had not 
Unqualifiedly condemned aggression. On the 
other hand, it is not unlikely that one of the 
distinctions taught children vis-à-vis aggres- 
sion is that the beha ‘ior is not appropriate in 
front of strangers. Such training might plaus- 
ibly account for the decrease in aggressiveness 
in the stranger-present group. ` 

One other discrimination that many parents 
Probably reinforced could have influenced the 
behavior of the Ss in the presence of their 
mothers. It is quite unlikely that the children 
had been restrained from make-believe fighting 
or from having dolls and toys fight or be 
mischjevous. Consequently, fantasy aggression 
may be considered by many children as ap- 
proved regardless of whether or not their 
mothers are watching. 

The above afterthoughts do not completely 
invalidate the displacement theory of fantasy 
Aggression, but they argue for greater refine- 
ment than the theory now possesses. It would 
seem that the prediction of aggression in doll 
play could be improved if the fantasy situation 
were analyzed as to its similarity to approved 
or disapproved conditions of aggression. As we 
stated earlier, it is unfortunate that in the 
present study the agents and objects of aggres- 
sive acts were not specified frequently enough 
to warrant their separate analysis. _ 

Two other hypotheses may be advanced to 
explain the fact that the presence of the mother 


did not restrain the children’s aggressiveness. 
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One involves the presumption that the like- 
lihood of retaliation for aggressive actions is a 
powerful inhibitor of aggression. It is soon 
clear to the child that there is no retribution in 
doll play, since the mother is in an obviously 
immobilized position. We may speculate that 
had the mothers been able to discipline their 
children, the amount of overt aggression would 
have been reduced rather than increased. 

A second conjecture about the increase of 
aggressiveness in the mother-present group 
arises from the nature of superego development 
in the child? Self control is a worrisome burden 
for the child so that he is ready to transfer his 
newly acquired control to his mother when she 
is available. Anecdotally, we know that 
mothers often complain that their children are 
so well behaved only when they are not around. 
If we assume that returning impulse control to 
the mother is most likely for children with the 
strongest superegos, we may indirectly test 
this “transfer of superego” hypothesis with the 
present data. 

There is some evidence that, at the preschool 
ages, girls have more strongly developed super- 
egos than do boys (9),° so that we might expect 
girls to be more influenced by the mother’s 
presence than are boys. The sex difference an- 
alysis gives little support to this hypothesis. As 
can be seen in Table 2, girls are no differently 
influenced by the experimental treatment than 
are the boys. 


SUMMARY 


Twenty children were observed in two ses- 
sions of doll play. The first session involved 
the child and an experimenter. During the 
second session, the child’s mother was an 
audience for one group and an adult female, 
not previously known to the child, watched the 
sessions of the second group. 

As is usual in doll play, boys are more aggres- 
sive than girls. Each of the ten children ob- 
served by their mothers were more aggres- 
sive in the second than in the first session. 


2 We are indebted to Dr. John W. M. Whiting for 
suggesting this interpretation of the findings. 

3 Questionnaires to the mothers of the children in 
this study asked the same questions as (9) about signs 
of superego development: confessions of wrongdoing, 
looking sheepish or guilty and admitting transgressions 
when asked. These data confirm the earlier results, 
indicating that girls show more signs of superego than 
do boys. 
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Eight of the ten children watched by a stranger 
decreased their aggressions in the second ses- 
sion. The findings are discussed in terms of 
modifications of the displacement theory of 
aggression. 
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VERBAL CONDITIONING AS A FUNCTION OF 
EXPERIMENTER CHARACTERISTICS 


ARNOLD BINDER, DAVID McCONNELL’, anp NANCY A. SJOHOLM 


Indiana University 


NUMBER of experiments have confirmed 
Greenspoon’s (5, 6) findings that 
various stimuli can serve as reinforce- 

ments for classes of verbal responses. It has 
thus far been reasonably convincingly demon- 
Strated that such categories as “plural nouns” 
(5, 6), “nonplural words” (5, 6), “animals” 
(2), “statements of attitude toward an aca- 
demic issue” (8), and “first person pronouns” 
(4, 7, 11) form response classes, and that such 
experimenter responses as “Mmm-hmm” (2, 
>, 6), “Good” (4, 7, 8, 11), and “expressions of 
agreement” (12) are effective reinforcements 
th leading to an increase in the frequency of 
Subjects? responses falling in certain of these 
Categories, It is especially significant that, with 
few exceptions, the subjects had been unable 
to verbalize the relationship existing between 
their own responses and the reactions of the 
€xperimenter (so-called “unconscious learning” 
or “learning without awareness”). 

Modern day interviewers of many differing 
Schools of thought stress the importance of the 
Mterpersonal aspects of the interviewing 
Process. According to this viewpoint, the 
interviewer (whether psychotherapeutic or 
Otherwise) is an important variable to consider, 
Since the responses of the interviewee are in- 
fluenced by that particular interviewer's 
Physical and behavioral characteristics. The 
changes in interviewee’s expressions of opinion 
concomitant with alterations in the character- 
istics of the interviewer have been shown in 
the studies reported by Cantril (3) and by 
Robinson and Rohde (9). Sarason (10) has 
Perhaps been most articulate in his advocacy 
that any complete analysis of a clinical or 
experimental situation involving a subject and 
an experimenter (or examiner or interviewer) 
must include consideration of the manner in 
which the social and personal characteristics 
of the latter influence the behavior of the 
former. 

The purpose of the present experiment was 
to study the effect of experimenter differences 
“pon performance in a verbal conditioning 


1 Now at Ohio State University. 


situation. The question at issue was: Would 
two experimenters differing markedly in 
physical and social characteristics produce 
different rates of learning in randomly selected 
subjects in an experiment similar to those in 
which minimum experimenter response has 
produced changes in the verbal behavior of a 
subject without his awareness? This experiment 
can, of course, only be considered preliminary 
since positive results would indicate the need 
for further research aimed at finding the specific 
effects of varying experimenter variables upon 
dependent learning variables. 

The class of subject responses reinforced for 
this experiment was the use of “mildly hostile” 
words in sentences made up by the subject, and 
the reinforcement selected was the response 
“Good.” The hostile word class was chosen 
because of the interesting incidental, interview- 
relevant comparison that might thereby be 
facilitated. 


METHOD 


Subjects 


Two groups of Ss were run for the study, both 
selected by means of routing slips voluntarily signed 
in a few introductory psychology courses at Indiana 
University. The two groups consisted of 29 and 30 
Ss, respectively. 


Experimenters 


The first group (29 Ss) was run by an attractive, 
soft-spoken, reserved, young lady who was 5'34” in 
height and 90 pounds in weight. The experimenter for 
the second group (30 Ss) was very masculine, 6'5” 
tall, 220 pounds in weight, and had many of the un- 
restrained personality characteristics which might be 
expected of a former marine captain. Both were stu- 
dents at Indiana University. Perhaps more important 
than their actual age difference of about 12 years was 
the difference in their age appearance; the young lady 
could have passed for a high school sophomore while 
the male experimenter was often mistaken for a faculty 
member. Henceforth, for the sake of convenience, we 
shall refer to the former experimenter as FE and the 
latter as ME. 


Materials and Apparatus 


Table 1 shows the two sets of words used in this 
experiment, one set consisting of “mildly hostile” 
verbs and the other of “neutral” verbs, all in the past 
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I 
described above. There was a sample card in the slotted 
holder as each S sat down in the experimental room, 
with the verbs “walked” and “ran” typed in the lower 
portion and the pronouns “he,” “they,” and “she” in 
the upper portion. The following instructions were 


then read: ? 


you can easily read the words on the card before 
you. I am going to put other cards before you one 
by one. Each of these will be like the one before 
you with five words on it, three words in the upper 
part of the card and two in the lower part. For 
each card, I would like you to make up a sentence 
using one of the three upper words as the first word 
in the sentence and one of the lower two words 
as the second word in the sentence. You may use 
any one of the three upper words and either one of 
the two lower words in your sentence. 

I will place a new card before you each time you 
make up a sentence using the words of the preced- 
ing card. Now, it doesn’t matter whether the sen- 
tence you make up is long or short, or even if it’s 


Sit in front of this stand in such a manner that | 
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TABLE 1 
VERBS PRESENTED TO SUBJECTS 

Neutral Hostile 
scribbled harmed 
snored raged 
saluted sneered 
paved squabbled 
heard vexed 
lost blamed 
bounced criticized 
charted bickered 
assigned aggravated 
shrugged punished 
paraded irritated 
recited resented 
enclosed hissed 
reclined quarreled 
misplaced annoyed 
enlarged jeered 
compiled pestered 
subtracted disliked 
emerged scolded 
regulated scowled 


tense. The words were selected from larger lists 
generously supplied to the experimenters by Dr. 
Arnold Buss. Buss had six judges sort a large list of 
verbs into one of three categories: “neutral,” “mildly 
hostile,” and “intensely hostile.” A word was accepted 
in a particular class only if five of the six judges agreed 
on its category. No “intensely hostile?” words were 
used in the present study. 

Seven sets of 20 pairs, each pair consisting of one 
hostile word and one neutral word, were obtained by a 
process of random matching. The matching of hostile 
words with neutral words for each set of pairs was 
done without replacement so that the 20 pairs in each 
set contained every word. Each of the pairs was typed 
in capital letters on a 3” x 5” white card. The pairs 
were placed about 114” from the bottom of each card, 
both words on a single line, and roughly centered with 
a blank space between them. The right or left position 
of hostile and neutral words was randomized. 

About 34" above the paired verbs on each card, 
again in a single line of type, were placed the three 
Pronouns “she,” “he,” and “they” in capital letters. 
The order of pronouns on each card was randomly 
determined, although the same three were used on ail 
cards. 

The presentation apparatus consisted simply of a 
front board (10}4” in height by 1814” in width) and 
two side boards (1014" x 19”) attached to the front 
board and extending back to the left and right of the 
e a e of these boards were to 
a slotted holder for th 3” 5 5” cards tee 

c e 3” x 5” cards on the upper 
center portion of the front board. The apparatus was 
placed on a table of such height that even the shortest 
S could see the head of the shorter experimenter, FE. 


Procedure 


Before starting the trials of 
menter shuffled each of the sey 
were before him, but conceale 


each new S, the experi- 
en stacks of cards which 
d from S by the boards 


simple or complex. It is important that you respond 
promptly with the first sentence that occurs to you. 
Let me stress that I would like you to respond with 
the sentence which occurs to you immediately after 
the exposure of the word card without further anal- 
ysis or selection. p 
After reading the instructions and answering any 
questions by rereading the relevant portions of the 
instructions, the experimenter presented the first 
stack card which happened to come out on top as a 
result of the preliminary shuffling. In order to find i 
operant response level, the experimenter provided no 
reinforcement for S responses to either this initial | 
card or any of the subsequent cards in the first stack. 
After completing the first stack in this fashion, the 


experimenter presented the first card of the second | 


e 


stack without any delay between stacks or any indica- 
tion to S that he had begun another stack. If the 
sentence made up by S in response to this card included 
the hostile verb rather than the neutral verb (regard- 
less of the particular pronoun used), the experimenter 
responded, “Good.” The experimenter said nothing 
if S used the neutral verb. The experimenter then 
proceeded through the remaining 19 cards in the second 
stack, reinforcing the use of hostile words with he 

and not responding to the use of neutral words. The 
remaining stacks were presented in the same way 
without pause or change in pattern when pune 
from the last card of one stack to the first card of th 

next stack. 

At the conclusion of the above procedure, “> f 
was asked questions to determine whether or no os 
had insight into the relationship between his ae i 
and the reactions of the experimenter. Great care Y ut 
taken to insure that both experimenters carried 0 
identical procedures and reinforcement patterns. 


RESULTS 


: k 
Each set of 20 cards was considered a pa ' 
for analysis purposes, giving a total of wait 
blocks of trials for each S. The data for ea¢ 


: : ving 
consisted of the number of sentences invol 


VERBAL CONDITIONING AND THE EXPERIMENTER 


TABLE 2 
Mean Number or Hoste Worps Usep 
PER BLOCK By ALL SUBJECTS OF THE 
Two EXPERIMENTS 


Experi- | No. of Block 
menter | Subjects T 
i{2|/3|4]/s|o6|7 
| | | a 
FE 29 8.89.1 10.3 10.711.211.411.6 
ME 30 9.49.8 10.110.610.9 11.1 11.2 


the use of hostile words given in each of the 
Seven blocks. Table 2 contains the mean num- 
ber of hostile words used in each block for the 
Ss run by FE and ME, respectively. 

Before comparing the performance of the 
two sets of Ss, it was desirable to establish 
whether or not any learning occurred for the 
two groups taken separately. For this deter- 
mination, one of the tests for trend recom- 
mended by Alexander (1), was used. The test 
used assumes a block-to-block fluctuation 
Pattern common to all Ss but requires no 
assumption as to the equivalence of individual 
Slopes. Table 3 summarizes the procedures 
applied to the data of each experimenter to test 
the hypothesis of no consistent trial-to-trial 
fluctuation or, in other words, of plateau 
Performance throughout. 

The Fs derived from group slope are sig- 
nificant beyond the .001 level and indicate that 
the two groups tend to increase in a linear 
fashion. The insignificant Fs derived from 
group deviations from linearity indicate that 
neither group shows consistent fluctuation 
apart from the linear trend. 

But the question immediately arises: How 
many of the Ss could verbalize the relationship 
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between their behavior and the experimenter’s 
responses, and would there be any consistent 
learning trend if only those Ss who showed no 
insight were included? Actually, seven of the 
Ss in FE’s group and five of the Ss in ME’s 
group demonstrated insight by verbalizing the 
contingency. 

Table 4 contains the results of the same 
trend test as described above as applied only 
to those Ss who had no insight. There were, of 
course, 22 such Ss in the group of FE and 25 
in 1 E’s group. Again the F for group slope for 
the Ss of FE is significant beyond the .001 
level. However, the corresponding F for the Ss 
of ME just misses being significant at the .05 
level (an F of 3.92 is necessary). The Fs derived 
from group deviations from linearity are once 
again not significant, indicating no consistent 
fluctuation other than linear trend. These 
results, together with the finding (reported 
below) of a highly significant over-all slope, 
support the hypothesis that the conditions of 
this experiment brought about the unverbal- 
ized learning to use hostile words rather than 
neutral ones in the structure of free sentences. 

The primary hypothesis that the learning 
rates of Ss in verbal conditioning are a function 
of the personal characteristics of the experi- 
menters, even though the participation of the 
experimenters is minimal and Ss are not aware 
of the experimenter response contigency, was 
tested by an extension of Alexander’s test for 
trend. The extension tests the reasonableness of 
concluding that two groups differ in slope. 
Only those Ss who did not demonstrate insight 
in the questioning at the end of the experiment 
were used in this analysis. The results may be 


TABLE 3 


SUMMARY OF TREND TESTS FOR THE Two FULL Groups or SUBJECTS TAKEN SEPARATELY 


Experimenter FE 


Experimenter ME 


Source SS df MS F SS df MS F 

Individual deviations from 807 140 5.76 519 145 3.58 

estimation (error) 
Group deviations from 14.0 5 2.80 49 3.00 5 -60 17 

linearity 

etween individual slopes 298 28 10.64 384 29 13.24 

Troup slope 200 1 200 34.72* 82.0 1 82.0 22.90* 

etween individual means 2493 28 89.04 1214 29 41.86 

Total 3812 202 2202 209 


* b < 001. 
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TABLE 4 


SUMMARY OF SEPARATE TREND TESTS FOR THE Two Groups or Supjects Snowrnc No Insicut 


Experimenter FE 


Experimenter ME 


Source ss df MS F SS df MS F 

Individual deviations from 548 105 5.22 427 120 3.56 

estimation (error) 
Group deviations from 7.0 5 1.4 .27 1.0 5 -20 -06 

linearity 
Between individual slopes 144 21 6.86 151 24 6.29 
Group slope 113 1 113 21.65* 13.0 1 13.0 3.65 
Between individual means 331 21 15.76 529 24 22.04 

Total 1143 153 1121 174 
*p < .001. 


seen in Table 5. The F coming from the be- 
tween group slopes is significant beyond the .01 
level, in support of the hypothesis that the two 
groups differ in learning trend. The significant 
F derived from between group means (p< .05) 
would lead us to reject the hypothesis that the 
two groups do not differ in average number of 
hostile words. Fig. 1 shows the learning curves 
of these groups over the seven blocks. 

The very highly significant over-all slope 
(p < .001) requires rejection of the null 
hypothesis that Ss do not show unverbalized 
learning to use hostile words under the condi- 
tions of this experiment. 

The availability of Ss at the time of the 
experiment did not allow us to equate the two 
groups for numbers of males and females. Not 
only did FE start out with more males than 
ME, but five of the seven Ss eliminated (for 


TABLE 5 


SUMMARY or THE TEST COMPARING THE 
TRENDS OF THE Two Groups SHOWING 
No Insicnr 


Source SS |df| MS F 

Individual deviations from | 977 |225 4.34 

estimation (error) 
Between individual slopes | 293 | 45) 6.51 
Between individual means | 860 45/19.11 
Group deviations from es- 5.0) 5) 1.0 

timation 
Between group slopes 30.3} 1130.3 | 6.98+* 
Between group means 19.0) 119.0 | 4.38* 
Over-all deviations from 5.0} 5| 1.0 28 

linearity 
Over-all slope 93.6] 193.6 |21.57*** 

Total 2282 9/328) 

*p <05. 

AN 

+p < 001. 


reasons of insight) from her group were fe- 
males. Of the five eliminated from M E’s group, 
three were male and two were female. Thus, 
for the test of difference in group slope, there 
were six females and 16 males in the FÆ’s 
group, and 15 females and 10 males in ME’s 
group. In order to justify any conclusions 
regarding the effect of experimenter character- 
istics upon learning, then, we must show that 
the differences found between the two groups — 
could not be accounted for purely on the basis 
of the differences in sex composition. 
Accordingly, the entire array of Ss without 
insight was divided into two groups on the 
basis of sex and irrespective of the particular 
experimenter used to get their data. Table 6 
presents a summary of the Alexander test for 
differences in trend as applied to the compari- 
son between female and male groups. The fact 
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TABLE 6 
Summary OF THE TEST COMPARING FEMALE 


AND MALE SUBJECTS WITHOUT INSIGHT 


Source SS |df| MS F 
a DENN i a i 
Individual deviations from | 958 225! 4.26 
estimation (error) | | 
etween individual slopes | 319 | 45 7.09) 
Between individual means | 877 | 45 19.49 
Troup deviations from es- | 23.0) 5! 4.6 
timation | 
Between group slopes 3.9} 1| 3.9 .92 
Between group means 2.0} 1j 2.0 47 
Over-all deviations from 5.0) 5| 1.0 23 
linearity | 
Over-all slope 93.6; 193.6 | 21.97* 
| 
Total 2281 -5828 
SARN E | 
*p <00. 
TABLE 7 
Meran Number or Hoste Worps Usep 
PER BLOCK BY MALE AND FEMALE 
a SUBJECTS WITHOUT INSIGHT 
Sex M | No: of Block 
j Subjects | 
IGUGUN 
a Bd | | | 
Male 26 18.79.1| 9.610.5.10.010.610.5 
Female 21 19.39.710.3) 9.610.1 10.1)11.0 
i ao) 


that neither the difference between group slopes 
nor the difference between group means led to 
Significant Fs supports the hypothesis that 
males and females do not differ in learning rate 
or over-all level of usage of hostile words in 
this particular context. The mean values for 
the two groups may be seen in Table 7. 


Discussion 


The principal hypothesis of this study was 
that learning rate in a verbal conditioning 
Situation is a function of experimenter char- 
acteristics even though the participation of the 
experimenter consists only of making a single 
word reinforcing response and the S is unaware 
of the reinforcing contingency. This hypothesis 
Was supported by the experimental results. 
However, the direction of the difference in 
group slope is quite interesting. As may be seen 
in Fig. 1, the Ss in FE’s group have a lower 
Mean number of hostile words for the first 
(operant) block but a higher mean number for 
the last block as compared with the Ss in ME’s 
8toup. It surely would not have been easy to 
Predict that the Ss of FE would show a sig- 
“ficantly greater learning rate or slope than 
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the Ss of ME. Preliminary conjecture, in fact, 
found it reasonable to expect that the response 
“Good” would have more of a reinforcing effect 
when uttered by the big, prestigeful, male 
experimenter (ME) than by the slightly built, 
soft-spoken, female experimenter (FZ). On an 
a posteriori basis, however, one may speculate 
that FE provided a less threatening environ- 
ment, and the Ss consequently were less in- 
hibited in the tendency to increase their fre- 
quency of usage of hostile words. 

The finding that the experimenter response 
of “Good” led to a significant trend of greater 
usage of hostile words in sentences is of some 
interest in itself, as it had not been demon- 
strated before. Buss and Durkee? carried out 
exploratory research on the verbal conditioning 
of hostile words. They presented their Ss with 
cards containing a “neutral,” a “mildly hostile” 
and an “intensely hostile” verb under in- 
structions to make up a sentence using one of 
the three words on each newly presented card. 
These investigators found that little or no 
conditioning was achieved when the experi- 
menter said “Good” after correct responses and 
nothing after incorrect responses, although 
conditioning did occur when “Right” and 
“Wrong” were used as reinforcers. Buss and 
Durkee, moreover, reinforced the intensely 
hostile words rather than the mildly hostile 
words in order to maximize the chances for 
learning. 

There were certain differences in procedure 
between the Buss and Durkee experiment and 
the present one which might account for the 
differing findings on the reinforcing effect of 
“Good.” But perhaps more important than 
these is the fact that different experimenters 
were involved in the two investigations—a 
factor which this experiment has shown to in- 
volve significant effects. 

Some of the other differences reported in the 
literature on verbal conditioning may be 
accounted for on the basis of experimenter 
variability. Verplanck (12), for example, re- 
ported that some of his student experimenters 
obtained learning effects in their Ss while other 
experimenters obtained essentially plateau 
performance. 

While it is true that the differences in 
learning effects found in this experiment were 


2 Buss, Arnold H., & Durkee, Ann. Personal com- 
munication. November 21, 1956. 
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concomitants of markedly different physical 
and temperamental characteristics of the 
experimenters, such variability is not at all 
uncommon in psychological experimentation. 
Moreover, it would not be unreasonable to 
expect that greater differential effects would be 
obtained in experiments where the extent of 
experimenter participation is greater than in 
the present experiment. Future research could 
profitably be directed toward determining the 
specific personality and/or physical character- 
istics of experimenters which lead to differ- 
ential learning effects, the types of learning 
situations in which such effects occur, and the 
constructs from learning and personality 
theory which can account for the phenomena. 


SUMMARY 


The purpose of this investigation was to test 
the hypothesis that differences in the char- 
acteristics of experimenters can lead to differ- 
ential learning effects when the experimental 
session involves verbal conditioning without 
awareness. Two experimenters of different sex 
and markedly different height, weight, age 
appearance, and personality ran separate 
groups of Ss. The Ss were presented with 3” x 
5” cards which contained the pronouns “he,” 
“she,” and “they” in random order and two 
past tense verbs, a “neutral” one and a 
“mildly hostile” one. The task of each S was 
to make up a sentence starting with one of the 
pronouns and utilizing one of the two verbs. A 
total of 20 neutral and 20 hostile verbs was 
used, arranged in seven blocks with random 
pairings of neutral and hostile words within the 
blocks. The experimenters said “Good” 
whenever an S used the hostile verb rather 
than the neutral one in his freely constructed 
sentence. 

The female experimenter’s group totaled 29 
Ss, but seven of these were eliminated from the 
final analysis because they could verbalize the 
relationship between their behavior and the 
experimenter’s response. The corresponding 
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numbers for the male experimenter’s group were 
30 and five. 

The data were analyzed by means of trend 
tests recommended by Alexander (1). The 
results showed that the response “Good” was 
reinforcing for the class of behavior consisting 
of the use of hostile words in sentences. In 
addition, it was found that the rates of learning 
for the Ss of the two experimenters differed 
significantly, with a steeper slope for the female 
experimenter’s group. 
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MEDIATED GENERALIZATION AND THE INCUBATION EFFECT 
AS A FUNCTION OF MANIFEST ANXIETY: 


MARTHA T. MEDNICK? 


Northwestern University 


NE OF the earliest and most widely 
cited studies attempting to verify 
clinically observed phenomena in the 

experimental laboratory was reported by 
Diven (6). In order to test some notions about 
the nature of anxiety, he conditioned a psycho- 
galvanic response (PGR) to the word “barn” 
and measured autonomic reactivity to other 
rural words. Responsivity to the rural words 
Was found to be greater than to a group of 
Nonrural words, and this was considered to be 
evidence for the generalization of anxiety. In 
addition, conditioned autonomic activity 
tended to increase as a direct function of the 
length of a rest pause preceding extinction. 
This was called the “incubation effect.’’ White, 
Cameron, and others (20, 4, 11), in their dis- 
Cussion of anxiety, deal with these aspects of 
the Diven study at length. 

The Diven findings on verbal generalization 
may be interpreted in terms of recent attempts 
to explain clinical phenomena on the basis of 
learning theory. The study may be regarded as 
an instance of mediated generalization (MG), 
a concept discussed by Cofer and Foley (5), 
which may be illustrated, using Diven’s verbal 
Stimuli, as follows: 

Barn ++ shock elevated PGR 
Bay oc ava Sains we A ROMA SO OE WERE elevated PGR 
Cow (a rural word)...Barn............ elevated PGR 


“Barn,” the implicit common or mediating 
term elicited by “cow,” elicits an elevated 


PGR. 

MG has been used in attempts to under- 
Stand the role of covert symbolic processes in 
the determination of behavior. Dollard and 
Miller (7) placed considerable emphasis on this 
Process in their discussion of personality de- 


1 This paper is based upon a dissertation submitted 

to the Department of Psychology of Northwestern 
University in partial fulfillment of the requirements 
for the Ph.D. degree. The writer wishes to thank Prof. 
anet A. Taylor, dissertation sponsor, for her advice 
and criticism. Mr. Bernard Bernstein gave much needed 
echnical assistance during the pilot stages of this 
tady, This paper was written while author was a 
SPHS Post-doctoral Fellow, Harvard Medical School. 

* Now at 63 Madison Rd., Waltham, Massachusetts. 


velopment in terms of learning theory. These 
writers also used the concept of anxiety incor- 
porated into the Hullian system by assuming 
that an anxiety response has drive properties 
and contributes to the total drive state of an 
organism. Some verification of this hypothesis 
concerning the drive properties of anxiety has 
been achieved through studies of the perform- 
ance on learning tasks of human Ss separated 
on the basis of response defined anxiety by 
means of extreme scores on the Taylor Mani- 
fest Anxiety Scale (MAS) (17). The results of 
a number of studies have supported predictions 
based on the assumption that MAS scores are 
related to drive level (18). Dollard and Miller 
argued further that generalization varies di- 
rectly with drive and, hence, with anxiety. 
While stimulus generalization (generalization 
along a physical continuum of similarity) has 
been shown to vary with primary drive in ani- 
mal studies (3, 12) and with anxiety (13), such 
a relationship has not been clearly demon- 
strated with MG. Lacey (9), replicating 
Diven’s study and results, found that generali- 
zation tended to vary directly with MAS 
scores, but the group differences did not reach 
significance. The Lacey study must be inter- 
preted with caution since the conditioning 
procedures utilized were somewhat irregular; 
the CS-UCS interval was 15 sec., filled with S 
associating aloud to the CS. This method was 
utilized in the interest of maintaining S’s un- 
awareness of the CS-UCS contingencies. 

The incubation effect described above was 
the other major finding reported by Diven. 
Bindra and Cameron (1) also studied this 
effect during the presentation of noxious stimu- 
lation rather than under conditioning and 
extinction paradigm. These investigators con- 
cluded that there appeared to be a genuine 
incubation effect but that little is known of 
its exact nature or its relation to other var- 
lables. 

The present study was an attempt to repro- 
duce Diven’s and Lacey’s findings with respect 
to the generalization and incubation of a con- 
ditioned autonomic response and to examine 
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the relationship of mediated generalization to 
manifest anxiety. An emphasis was placed on 
an empirical choice of generalization stimuli. 
It should be noted that generalization words 
were chosen by Diven and Lacey on a rational 
basis. It was felt that additional work in this 
area would profit by a pre-experimental dem- 
onstration of the associational relationship of 
the generalization stimuli to the CS by making 
it possible to order the stimuli along a rough 
continuum and to attempt to obtain a gradient 


of MG. 
METHOD 


Subjects 

Two groups of 45 Ss each were chosen from intro- 
ductory psychology classes at Northwestern University 
on the basis of extreme scores on the Heineman form 
of the MAS (8). Forty-five individuals (15 males and 
30 females) receiving scores of 64 and above on the 
scale were included in the high anxious group (HA), 
and the 45 Ss scoring 49 or below (35 males and 10 
females) were designated as the low anxious group 
(LA). In addition, three LA and four HA Ss were 
tested, but because they showed no evidence of con- 
ditioning, they were excluded from further considera- 
tion. Ss were assigned to the various conditions by a 
noninvolved person so that Æ had no knowledge of 
MAS scores during the experiment. 


Apparatus 


Galvanic measures were made from a standard 
Stoelting polygraph. Basal readings, taken directly 
from a dial on the instrument, were recorded period- 
ically. At random intervals during the presentation of 
the experimental stimuli, an artificial resistance of 
1000 ohms was thrown into the machine. This provided 
a standard measuring unit for each record. The verbal 
stimulus materials were presented to S by means of a 
tape recorder via earphones. A loud raucous noise 
which served as the UCS was also presented by means 
of the tape recorder. A measure of the intensity of the 
test stimuli revealed no differences between stimuli. 


Verbal Stimulus Materials 


In constructing the stimulus materials, the results 
of a recent study by Russell and Jenkins (15) were 
utilized. These investigators administered the 100 
words from the Kent-Rosanoff Word Association Test 
to 1,007 college students. A single association was 
elicited to each of the words and the frequency of each 
association determined. For the present study, the 
word that was chosen as the CS appeared (in varying 
degrees of frequency of association) as a response to a 
number of different Kent-Rosanoff stimulus words. 
Thus, the words “dark,” “heavy,” “soft,” and “lamp” 
all had as a common response the word “Tight.” “Light” 
was therefore designated as the CS in the present study 
and the words to which it had been associated as the 
generalization test words. 
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TABLE 1 
PERCENTAGE OF INDIVIDUALS ASSOCIATING 
THE WORD “LIGHT” To GENERALIZATION 


Worps 
Generalization Words 
Sample v = 
Dark Lamp Heavy Soft 
Minnesota 1007 83 63 58 86 
Northwestern 174 66 59 47 mil 
Minnesota* 
HA 28 86 50 54 0 
LA 28 82 61 57 0 
Northwestern 
HA 25 60 60 32 4 
LA 25 o4 60 60 0 


* Personal communication from Dr. Shirley Jahnson, Uni- 
versity of Minnesota. 


Table 1 presents the percentage of individuals in 
the Minnesota group who responded to each generaliza- 
tion test word with the word “light.” A list of 40 of 
the 100 Kent-Rosanoft stimulus words containing all 
of the test words was also administered to a sample of 
Northwestern University students; the percentage of 
individuals responding with the word “light” to the 
test words in this group can also be seen in Table 1. 
Since it is important to show that HA and LA individ- 
uals do not differ from each other with respect to these 
associational frequencies, the relevant data are in- 
cluded in Table 1. No important differences were 
found between these two groups. 

The stimulus materials used during the conditioning 
trials consisted of the CS and a list of 24 neutral words 
(a neutral word being one to which the CS was not 
given as an association by the Minnesota group). Each 
of the neutral words was repeated three times in dif- 
fering orders, while the critical word, “light,” was 
presented on 14 occasions. Thus, the training series 
consisted of 72 presentations of neutral words and 14 
randomly spaced presentations of the CS. The Taucous 
buzzer followed the CS on 9 of the 14 occasions it 
was presented. This partial reinforcement procedure 
was followed for two reasons: as an attempt to prolong 
extinction, and to allow the trials on which the CS was 
presented without the UCS to serve as a test for con- 
ditioning. The test stimulus list, used for measure 
of generalization and extinction, consisted of the CS, 
the four generalization stimuli, and three a 
stimuli. This series was repeated eight times with ite 
order of the four generalization words and one of a 
neutral words (“square”) being counterbalanced ea 
to avoid positional effects. The remaining two ot a 
words were included so as to increase the num ‘time 
words heard by S between the other words. aoe rhe 
between the presentation of each word in bot ah 
conditioning and test series varied from 8 to 12 to 
thus allowing some time for return of S’s response 
basal level. 


Conditioning Trials 

onditioning 
se 
fro™ 


The Ss in all groups received the same co! 
procedure. The experiment was conducted ae 
soundproof room that offered relative freedo! 
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distraction. The earphones effectively eliminated other 
distracting noises. Each S was seated in a comfortable 
broad-armed easy chair with electrodes and earphones 
attached to him. Initial instructions, designed to 
acquaint S with the nature of the apparatus, were 
given by Æ. A cloth screen shielded the apparatus from 
S's view. Additional instructions informing S about 
the stimuli to be presented were on tape and heard by 
S over the earphones, These indicated that a buzzer 
and words were to be presented at various intervals 
and that S’s task was merely to remain quiet and listen 
Since his physiological reactions were being studied. 
No mention was made of the relationship of the buzzer 
to the appearance of any specific word. 

Following the instructions, 15 “adaptation” words 
Were presented to allow S to adjust to the apparatus 
before conditioning was attempted. These adaptation 
words were immediately followed by the conditioning 
Series, 


Test-Extinction Trials 


Each of the two anxiety groups was divided at 
random into three subgroups for the extinction pro- 
cedure, One subgroup (Condition I) underwent a regular 
extinction procedure immediately following the con- 
ditioning series. There was no interruption between 
training and extinction, no additional instructions 
Were given, and Ss remained attached to the apparatus. 
The second subgroup (Condition II) had the electrodes 
and the earphones removed with instructions from 
E that they would be allowed to rest; S remained in 
the chair with Æ seated out of sight behind the screen. 
After a lapse of 10 min., S was reattached to the ap- 
Paratus for the extinction procedure. The Ss were 
readapted to the point where deflections had mainly 
ceased before the tape recording was again turned on. 
The third subgroup of Ss (Condition II) was allowed 
to leave following training and instructed to return 
exactly 24 hrs. later for the remainder of the experi- 
ment. Prior to his final dismissal, each S was asked to 
report whether he knew which word had been followed 


by the buzzer. 
RESULTS 


Raw ohms resistance data were obtained 
from the records by measuring the heights of 
galvanic deflections, using the 1000 ohm arti- 
ficial deflection as the measuring unit. Follow- 
ing the method suggested by Lacey and Segal 
(10), ohms resistance was then transformed 
into conductance units, micromhos (M), by 
use of the formula, M = 10° (1/r). Due to the 
skewed nature of the data, median measures 
and appropriate nonparametric statistics were 
used throughout the analyses. In cases where 
analysis of variance by ranks was used, the 
obtained chi-square values were converted to 
F values (19). 

_ As noted above, there were unequal propor- 
tions of males and females in the two anxiety 
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CONDITIONING 


[EXTINCTION e-o HA GROUP 
e—e—— LA GROUP 


8 


6 


8 


TEST TRIALS, 

Fic. 1. MEDIAN AMPLITUDE oF CONDITIONED RE- 
SPONSES AT Eacn Test TRIAL DURING CONDI- 
TIONING AND DURING IMMEDIATE EXTINCTION 
(Extinction Data For CONDITION I ONLY) 


groups so that the anxiety and sex variables 
were confounded. In order to evaluate the pos- 
sible effects of sex on conditioning, subgroups 
of 23 males and 23 females equated for MAS 
scores were compared as to the amount of 
conditioning demonstrated. The differences 
were not significant (chi square = 1.8). There 
was thus no evidence that sex was a variable 
influencing conditioning in this study. 

Another preliminary analysis was made in 
order to see whether the three incubation 
groups were equally conditioned before the 
commencement of extinction. An analysis of 
variance by ranks demonstrated that these 
groups did not differ significantly (F = .848). 

It should also be noted that of the 90 Ss, 89 
were able to state that “light” had been the 
word followed by the buzzer. 


Conditioning 


The responsivity to the five conditioning test 
trials (those occasions on which the CS was 
not accompanied by the buzzer) represented a 
measure of conditioning. As can be seen from 
Fig. 1, the HA group responded with greater 
amplitude on all test trials than did the LA 
group. A total conditioning score for each S 
was obtained by summing the amplitudes of 
all five of his conditioned responses (CRs) 
during training. The differences between 
groups in total scores, evaluated by means of 
chi square, were significant (chi square = 4.8, 
p < .05 for 1 df). 


Generalization 


The tests for generalization were based on 
the amplitude data of the first extinction series 
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C20 


HIGH ANXIOUS 
o— LOW ANXIOUS 


CONDUCTANCE IN MICROMHOS 


O THT 


(CS) 


SQUARE 
(NEUTRAL) 


LAMP HEAVY SOFT 
GENERALIZATION WORDS 
Fic. 2. MEDIAN AMPLITUDE OF RESPONSE TO EACH 

Worp ror HA anp LA Ss 


DARK 


TABLE 2 


CHI-SQUARE ANALYSES BETWEEN Worps 
FOR ALL Ss COMBINED 


(All at 1 df) 7 
Word Pair RP $ 
Light-Lamp 17.28 <.001 
Lamp-Dark 5.28 <.05 
Lamp-Heavy 2.78 n.s. 
Lamp-Soft 2:37 n.s. 
Dark-Square 4.65 <.05 
Heavy-Square 10.32 <.01 
Soft-Square 12.00 <.01 


because of the great increase in the number of 
response failures in the later series. The word 
“square” was used as the neutral stimulus in 
all comparisons since the order of presentation 
of this neutral word was counterbalanced. As 
may be seen in Fig. 2, the median amplitude 
for the test words tended to be higher than 
for the neutral word in both groups. With re- 
Spect to the degree to which the test words 
were associated with the CS “light,” the most 
strongly associated word “dark” elicited less 
responsivity than would be predicted on the 
basis of association alone. The remaining three 
stimuli roughly followed the expected pattern, 
with “lamp” having the greatest reactivity. 
In order to test the significance of the ob- 
served differences in responsiveness between 
the individual words, the data for all 90 Ss 
were combined and the differences in reactivity 
to the test words evaluated by means of chi 
square. The resulting values are reported in 
Table 2. All the associated words elicited sig- 
nificantly greater reactivity than did the 
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neutral word “square,” thus demonstrating 
that generalization had taken place. 

Turning to the relative performance of the 
two groups, examination of Fig. 2 reveals that 
the HA group showed consistently greater 
median amplitudes than the LA group. In 
order to obtain a total generalization score, 
the amplitudes of responses to all four test 
words were summed for each S. A chi square 
with respect to this overall score demonstrated 
the HA group to have significantly more gen- 
eralization reactivity than the LA group (chi 
square = 9.3, p < .05 for 3 df). 

Since the PGR amplitude of the HA Ss was 
significantly greater than the LA group during 
conditioning, this fact might be invoked to ex- 
plain the obtained differences in generalization. 
In order to see whether these differences in 
generalization between the HA and LA groups 
could be demonstrated independently of this 
initial difference in conditioning level, two sub- 
groups of 24 Ss cach, one from the HA and 
one from the LA group, were matched on the 
basis of their total amplitude score in condi- 
tioning. A chi-square analysis between these 
subgroups of total generalization scores proved 
to be significant (chi square = 14.96, p < .01 
for 2 df). 

A final comparison was made among the 
incubation groups. These three groups, it will 
be recalled, all received identical training and 
differed only in the amount of time allowed to 
lapse before commencing with the regular test- 
extinction procedure. As previously mentioned, 
it was demonstrated that these groups, referred 
to as Condition I (no rest interval), Condition 
II (10-min. rest interval), and Condition on 
(24-hr. interval) were equally responsive at t S 
end of training; any subsequent differences ar 
thus attributable to the rest-interval variable: 
Fig. 3 summarizes the generalization data 4. 
these three conditions for the HA and w 
groups combined. An analysis of variance 4 
ranks, testing the differences between a, 0 
tions I, II, and III, was significant Pee 3 
p < .01 for 2 and 8 df). Inspection of Tig. 
shows that this significant difference may i 
attributed to the consistently greater a a 
tude of Condition III, Conditions I and II it 
g . e nera 
ing more or less similar at each of the gen’ the 
zation test points. It seemed possible that i 
amount of incubation time might affect me 
HA Ss differently from the LA Ss. The jä 
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lest between incubation conditions was there- 
fore repeated for the LA and HA groups 
separately. For the HA group, the difference 
between Conditions I, II, and III did not reach 
Significance (F = .455), but significance was 
reached for LA Ss (F = 8.5, p < .05 for 2 and 
8 df). The significance of differences between 
conditions reported above could thus be traced 
in large part to differences in the LA group. 


Extinction 


The generalization data reported above rep- 
resented the first of the eight series of test 
trials. The nature of all eight series makes it 
possible to follow the extinction of the CR. 
Only extinction of responses to the CS will be 
described since the test words extinguished 
very rapidly following the first series. Table 3 
presents the median PGR amplitude at each 
extinction series for the CS for all groups and 
subgroups in the study. When the total HA 
and LA groups are considered, it may be seen 
that the HA group tended to maintain greater 
responsivity than the LA group until both 
groups finally demonstrated a median respon- 
sivity of zero. This greater responsivity was 
found to some degree in all three extinction 
conditions, but was most marked in Condition 
I where extinction immediately followed train- 
ing. It might also be noted that contrary to 
the other conditions, the median PGR ampli- 
tude of the Condition I HA Ss did not ap- 
Proach zero by the end of the series, indicating 
that the CR did not completely extinguish. 


TABLE 3 


MEDIAN AMPLITUDE AT Each EXTINCTION 
SERIES FOR INCUBATION AND ANXIETY 


SUBGROUPS 
Extinction Series 
1 2 3 4 5 6 7 8 
Cond. I 
LA 10.0 | 12.0 | 7.5 | 13.0 | 13.5 | 5.0] 3.5] 1.0 
HA 45.0 | 40.0 | 30.0 | 44.0 | 42.5 | 19.5 | 23.0 | 28.0 
Total 32.5 | 22.0 | 13.0 | 28.0 | 23.0 | 6.0 | 13.0 | 10.0 
Cond. II 
LA 19.0 | 11.5 | 23.0 | 12.0 | 7.0 2.5 3.5 0 
HA 50.0 | 21.0] 9.5] 0 9.5] 0 0 0 
Total 21.5 | 13.0 | 19.0 | 8.0 | 8.0) 0 0 0 
Cond. IIT 
LA 37.0 | 17.0 9.0 a) 0 0 0 
HA 29.0 | 25.5 | 11.5 | 8.0] 2.5) 0 0 0 
Total 34.0 | 22.0 9.0} 4.0) 0 0 0 
Total 
Group 
LA 15.0 | 12.5 | 9.0 | 11. 8.0 | 4.5] 2.0] 0 
HA 43.0 | 33.5 | 20.5 | 17.5 | 10.5 1.0) 0 0 


Discussion 


The results of this study confirm Diven’s and 
Lacey’s findings with respect to generalization. 
An inspection of the generalization data also 
seem to indicate that reactivity followed a 
rough gradient, i.e., autonomic responsivity 
varied directly with the degree of association 
of the test stimuli to the CS (“light”). 

Turning to the conditioning data, some 
comment may be made regarding the condi- 
tioning curves themselves. Although it is ap- 
parent that conditioning took place, the 
curves (Fig. 1) are irregular, responsivity being 
less at the end than in the middle of training. 
Such generally declining response curves have 
also been reported by Bitterman and Holtz- 
man (2) and may be at least partly a function 
of adaptation. In the present study, an ex- 
planation in terms of the training procedure 
also suggested itself. It will be recalled that the 
CS was presented alone without the buzzer on 
irregularly spaced trials during training. Thus, 
conditioning test Trial 1 followed one UCS; 
two presentations of the UCS intervened be- 
tween Trial 1 and Trial 2, one UCS between 
Trials 2 and 3, three between Trials 3 and 4, 
and one between Trials 4 and 5. Trials 2 and 
4, consequently, had two and three UCS pres- 
entations immediately preceding them, 
whereas Trials 3 and 5 had only one. Inspec- 
tion of Fig. 1 reveals that the irregularity is 
contributed to by a decline in responsivity on 
Trials 3 and 5. It is possible that the decline 
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ee 
of responsivity on these trials was at least 
partly related to their having fewer interven- 
ing presentations of the UCS; that is, presen- 
tation of the CS alone might have acted as an 
extinction trial, the effects of extinction still 
being noticeable when only one presentation of 
the UCS intervened between two test trials. 
With respect to the anxiety variable, previ- 
ous research using the MAS has shown that 
HA Ss exhibit both a higher level of perform- 
ance in conditioning and extinction and higher 
gradients of SG (2, 18, 13). To the extent that 
SG and MG are affected by the same variables, 
HA Ss were expected to show more MG than 
LA Ss. The expected conditioning and extinc- 
tion differences were found, and the expecta- 
tion with respect to MG was also confirmed. It 
is particularly of interest, in view of the groups 
differences in conditioning, that generalization 
differences were sustained between HA and LA 
subgroups equated for conditioning perform- 
ance level. This supports the Dollard and 
Miller treatment of MG and SG as subsidi- 
aries of the class of phenomena called generali- 
zation. 

The extinction data of the group as a whole 
are somewhat at variance with those of other 
investigators (2) who reported consistently 
greater PGR responsivity during extinction for 
HA Ss than LA Ss. In the present case, this 
difference held up only through the fifth ex- 
tinction series, after which it diminished 
sharply. However, it is important to note that 
Group I was the only group receiving im- 
mediate extinction and hence is the only one 
directly comparable to those of the earlier 
studies, An examination of Fig. 1 reveals that 
under Condition I, HA Ss showed greater 
responsivity throughout and also more resist- 
ance to extinction than the LA Ss. This result, 
then, supports the earlier finding. The data 
also suggest that an unsuspected effect of the 
rest interval was to speed extinction for the 
HA group and thus diminish the differences 
between the two anxiety conditions. 

The incubation of anxiety has been accepted 
as a fairly meaningful and reliable clinical 
phenomenon. If an individual learns an anxiety 
reaction with respect to a particular stimulus 
situation and after a long interval returns, as it 
were, to the scene of the provocation, the anx- 
iety is thought of as becoming more and more 
enhanced with the passage of time. As indi- 
cated, there is some experimental evidence to 
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support the assertion of such an effect. How- 
ever, the findings of this study are not com- 
pletely in line with this contention. In keeping 
with others’ results, a 24-hr. interval between 
training and extinction operated to increase 
galvanic responsivity. This effect was short- 
lived, however, lasting only for the first extinc- 
tion series. The 10-min. interval did not result 
in any PGR increase. This suggests only that 
recovery from adaptation may be involved and 
that a 10-min. interval is an insufficient amount 
of time for the recovery to show its effect. 

MG has been demonstrated to be useful in 
the experimental study of learning and think- 
ing (16). The main finding of this study, the 
demonstration of a gradient of MG and its re- 
lationship to manifest anxiety, indicates that 
this may also be a useful concept in the study 
of pathological disturbances of thinking and 
learning. 


SuMMARY 


The present study was designed to investi- 
gate the phenomena of mediated generalization 
and the incubation effect. An investigation was 
also made of the relationship of mediated 
generalization to level of manifest anxiety as 
measured by the Taylor Manifest Anxiety 
Scale (MAS). 

Two groups of 45 Ss each were chosen on 
the basis of extreme scores on the MAS. A con- 
ditioned PGR was first established by repeated 
pairings of a critical word (CS) with a raucous 
buzzer. Tests of mediated generalization were 
made following training, using as generalization 
stimuli, words to which the CS was associated 
as determined by free associations of a stand- 
ardization group. In order to investigate the 
incubation effect the Ss were divided follow- 
ing training into three subgroups ee, 
the length of the rest interval between conc E 
sion of training and the institution of extinc 
tion trials (immediate extinction, 10-m1- 
interval, and 24-hr. interval). 

The results indicated heightened tp 
sivity towards words associated with the : K 
thus indicating that mediated generalizat i 
had taken place. It was also found that ha 
treme MAS scores were directly related to E 
magnitude of mediated generalization a 
sivity. In agreement with reports of ot pi 
investigators, level of manifest anxiety ‘ad 
found to be directly related to condition”? 


on- 
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responsivity as well as to resistance to ex- 
tinction. 

_ Increased PGR reactivity was found follow- 
ing the 24-hr. pre-extinction interval (i.e., an 
incubation effect), but the influence of this 
effect was short-lived being confined to the 
early part of extinction. The 10-min. delay 
Prior to extinction did not result in galvanic 
activity which was significantly greater than 
that found with the immediate extinction 
condition, 
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A SCALE FOR MEASURING ATTITUDES OF SOCIAL 
RESPONSIBILITY IN CHILDREN! 


DALE B. HARRIS 
Institute of Child Welfare, University of Minnesota 


an attempt to evaluate an attitude of 

social responsibility in children. One of 
the scales used, adapted from the Gough 
responsibility scale (2), and identified as the 
“How I Feel About Citizenship” scale has 
shown virtually no increment with increasing 
age from 9 to 18 years. This fact, rather unique 
in the child psychology literature, together 
with the fact that a number of the items had no 
theoretically demonstrable relationship to the 
definition of social responsibility adopted for 
this study, suggested that the scale be revised 
and revalidated. For the new instrument, we 
retained items from the old scale and devised 
additional items that could be related to our 
theory of responsibility as a composite of 
attitude elements reflecting behavior clas- 
sifiable as reliable, accountable, loyal, or doing 
an effective job. A scale of 89 items was finally 
prepared, each of which could be justified as 
representing one or more of the above men- 
tioned attitudes. Each item was answered as 
“agree” or “disagree.” 


Pes publications (3, 4) have described 


DEVELOPMENT OF THE SCALE 


To provide a criterion, a pupil nomination 
form of a modified Guess Who type was 
administered to children in 13 seventh-and 
eighth-grade classes. The form asked each 
pupil to nominate the boys and girls who were 
“best fits” in his class for each of the four 
aspects of responsibility mentioned above. 

The following untitled four descriptions, 
designed to illustrate these four features of 
responsibility, were submitted to the children: 

1. This is a person upon whom you can 
always count; you can depend on him. When 
he promises that he will do something, you 


'This investigation was supported in part by a 
research grant (M-690) from the National Institute of 
Mental Health of the National Institutes of Health, 
Public Health Service, John E. Anderson and Dale B. 
Harris, Principal Investigators, and in part from funds 
supplied by the Institute of Child Welfare, University 
of Minnesota. The writer wishes to acknowledge the 
assistance of Dr. John E. Anderson and Dr. Harry 


Beilin in evaluating the theoretical relevance of items 
for the scale. 


can always count on his doing it. You can 
count on his word and trust him. 
2. This person is a square shooter. He 
doesn’t try to take advantage of or cheat 
others. When he has done something wrong, 
be will own up to it rather than try to blame 
some one else. 
3. This person thinks for the good of others 
rather than always for his own gain. He is 
loyal to the group. 
4. This person is one who gets things done. 
On a class project, on a committee or on a 
work job he gets right to work and can be 
counted upon to do well and promptly. 
Below each of the descriptions, spaces per- 
mitted writing in three boys’ names and three 
girls’ names, members of the class who were 
considered “best fits” for each description. The 
children were not asked to write their nomina- 
tions in order of merit. Children were instructed 
that any one child might be nominated for 
more than one characteristic. Independently of 
the children, the room teachers also filled in a 
nomination form. The 89-item scale of at- 
titudes, now called the “Social Attitudes 
Scale,” was administered to the same school 
classes a day or so following the “pupil nomina- 
tion” form. à 

The children, who had spent all their school 
time with their principal teachers for the 
previous six months, were as well acquainted 
with each other and with their teachers as is 
generally found in public schools. They repre- 
sented the socioeconomic distribution of the 
city. Two hundred and twenty-seven boys and 
228 girls completed all of the schedules. 
Fortunately there were few absentees or ra 
complete records, and none among the crucia 
cases selected by the nomination form. | 

Although children were not required tO 
nominate three of each sex, most of them a 
plied six names for each characteristic. A tally 
sheet was prepared for each of the four ee 
acteristics within each class group. The ee 
ber of times a given child was mentioned 10 
one characteristic constituted his score on oo 
trait. Thus, the score did not distingus 5 
whether a child’s name appeared in first a 
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third place on a particular blank. Only a very 
few children received no mentions at all in 
their groups and, as is usually the case with 
Guess Who devices, five or six children in a 
group received the majority of the votes cast. 

Rank-order correlations among the several 
traits for six class groups, one from each of 
six schools, show that the same names tended 
to appear high on the list for each of the four 
traits. For particular pairs of traits the medians 
of coefficients calculated from six school 
Samples varied from +.78 to +.82°, showing 
the order of relationships among the four traits 
to be much the same regardless of the pair of 
traits selected. Moreover, there was relatively 
little variation among the several school classes 
In the size of obtained correlations; values 
ranged typically from +.70 to +.89. The 
intercorrelations of scores based on one trait 
with scores obtained by summing scores for 
the other three characteristics ranged from 
+.78 to +.95 in various classes of 33 to 37 
children, Thus, no one characteristic appeared 
to be more closely related than another to the 
sum of scores on the remaining three aspects 
of responsibility. A total score obtained by 
summing the four trait scores appeared 
justified by this marked commonality among 
the aspects of responsibility. 

To provide a criterion for selecting the 
attitude items associated with responsibility, 
25 boys and 25 girls scoring highest on the sum 
of the four characteristics were selected from 
the total population. In every case children 
were selected who were also named for some 
component of responsibility on the teacher’s 
ballot. In only three instances it was not pos- 
sible to include the highest scoring child by 
the peer ballot when the teacher’s judgment 
was added. These 50 responsible children were 
selected, two to six from each class, so as to 
include about a 10 per cent sample of each 
class enrollment. Boys and girls were selected 
in proportion to their distribution in the class. 
In the largest class groups an extra boy or girl 
was taken, since somewhat more than a 10 

2 Tables giving these data in full have been deposited 
with the American Documentation Institute. Order 
Document No. 5323 from ADI Auxiliary Publications 
Project, Photoduplication Service, Library of Congress, 
Washington 25, D.C., remitting in advance $1.25 for 
35 mm. microfilm or $1.25 for 6 by 8 inch photocopies. 
Make checks payable to Chief, Photoduplication 
Service, Library of Congress. 

3 Ibid. 
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per cent sample was needed to produce 50 
cases. 

From each class an equal number of boys and 
girls were selected from the unmentioned or 
lowest scoring children, to constitute a 
criterion sample of “less responsible” children.4 


Discussion 


The attitude tests for these criterion groups 
were segregated and analyzed. Proportions of 
each group subscribing to each item were read 
onto Lawshe’s nomograph (6) and the dis- 
criminatory power of the item recorded. Fifty 
items satisfied a Lawshe value of +0.4 or 
better and have been retained. This represents 
approximately the 10 per cent level of signifi- 
cance. Of the 50 items, 21 appeared in the 
“How I Feel About Citizenship” scale used in 
the previous studies, and 29 were new items. 
Twenty-two of the items in the previous blank 
were not used in this schedule, either because 
they did not in our opinion have construct 
validity or did not hold up in the revalidation 
procedure described in this paper. This new 
plank, entitled “Social Attitudes Scale,” was 
used in the 1954 Nobles County studies.® 
Retest reliabilities after an interval of four 
months ranged from +.60 to +.70 for various 
groups of eighth-and tenth-grade children. The 
items are reproduced in Table 1. 

In another study, Rosemary Conzemius (1) 
used the “Social Attitudes Scale” with all 10- 
year-olds and high school seniors enrolled in 
the rural schools of another Minnesota county. 
She divided the children into those who be- 
longed to 4-H Clubs, and those who did not. 
Table 2 compares data from her study with our 
results. Her non-4-H members tend to be 
slightly above our Nobles County rural youth, 
and her 4-H Club members appreciably exceed 
the Nobles County youth in comparable age 
groups. Conzemius found that 4-H Club 
members significantly exceeded the nonmem- 


4 This method does not select irresponsible children 
by nomination for specifically described characteristics. 
Direct peer nominations for socially unacceptable 
behaviors sometimes have unfortunate social and 
public relations consequences. It is assumed that most 
such children would be included in the low count or 
unmentioned children when the positive nomination 
is requested. 

5 A county-wide survey of adjustment and mental 
health in children was conducted through funds sup- 
plied by the National Institute of Mental Health and 
the Institute of Child Welfare. 
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TABLE 1 


Trems RETAINED IN THE “SOCIAL ATTITUDES SCALE” 


1. It is always very important to finish anything one has 
started. 

2. At school, it is easy to find things to do when the teacher 
doesn’t give us enough work. 

3. Police cars should be especially marked so that you can 
always see them coming. 

4, It is no use worrying about current events or public affairs; 
I cannot do anything about them anyhow. 

5. We ought to worry about our own country and let the rest 
of the world take care of itself. 

86, In school my behavior gets me into trouble. 

27, I am hardly ever on time for meals. 
4g. I have been in trouble with the law or police. 

9. When a person does not tell all his income in order to get 
out of paying some of his taxes, it is just as bad as steal- 
ing money from the government. 

10. A person who does not vote when he can, is not a 
good citizen. 

311, I hardly ever get my school work done on time. 

812. I have played hookey from school. 

13. Every citizen should take the time to find out about current 
events if it means giving up some spare time. 

14, In school I am sometimes sent to the principal for being 
bad. 

15. Maybe some minority groups (Negroes, Indians, Mexicans, 
Jews, etc.) do get bad treatment, but it’s no business of 
mine. 

16. We ought to let Europe get out of its own mess. 

17. People criticize me for wasting time. 

“18. When I work on a committee, I usually let other people do 
most of the planning. 

*19. I am often late for school. 

20. If it is worth starting at all, it is worth finishing. 

^21. I am the kind of person that people can count on. 

222, In school, I am one of those who can go on working even 
though the teacher is out of the room. 

23. People can count on me to get things done, without check- 
ing on me. 

^24. I am frequently chosen as a room helper or to run errands. 

®25. I do my chores the very best I know how. 

26. I have been elected leader or president of my class. 

27. Nothing is more important than to be honest with other 
people. 


a28. 
29. 


30. 


31. 


32. 
33. 


34. 


35. 


36. 


a37. 


38. 


39. 


40. 
4l. 


42 


43. 


. It doesn’t really matter whether parents at 


My teacher often complains because I don’t finish my work. 

When you can’t do a job, it is no use to try to find someone 
else to do it. 

It is more important to get the job done than worry about 
hurting other people's feelings. 

Why bother to vote when you can do so little with just your 
one vote. 

“Never give a sucker an even break,” 

Letting your friends down is not so bad because you can’t 
do good all the time for everybody. 

Our country would be a lot better off if we didn’t have elec- 
tions and people didn’t have to vote. 

It’s a good thing the Atlantic Ocean separates us from Eu- 
rope because then we don’t have to worry about them. 

It's more important to work for the good of the team than 
to work for your own good. 

I would never let a friend down when he expects something 
of me. 

People would be a lot better off if they could live far away 
from other people and never have to do anything for them. 

Every person should give some of his time for the good of 
his town or city. 

If everyone pitches in to do a job, it can always get done, 

It is a good rule to do something for your neighbor if he does 
something for you. 

which are important should come before things 


Doing things 
you enjoy doing. 

When a person doesn’t like something he is supposed to do 
he will try to get someone else to do it if he is smart. 


. Cheating on examinations is not so bad as long as nobody 


ever knows 


. People have a real duty to take care of their parents when 


they are old, even if it costs a lot. 


. I usually work things out for myself rather than get some 


one to show me how. 


. I usually volunteer for special projects at school. 
. Children often get punished when they don’t deserve it. 
"49, 


When given a task I stick to it even if things I like to do 


better come along. 
tend Parent- 


Teacher meetings regularly. 


® Designates “personal reference” item. 


TABLE 2 


RESPONSIBILITY TEST Scores FOR SEVERAL 
RURAL SAMPLES 


Boys Girls 
Sample 
N|X|SD|N| X |50 
10-year-olds, Nobles 225/34.4/6.48/204/36.5/5.54 
County 
Conzemius’ 10-year-olds 
Non-4-H members 85/35 .8)4.51| 92/38.5|/5.48 
4-H members 54/39.3)4.67| 50/40.4|5.34 
17-year-olds, Nobles 119}37.8)5.96)112/41 .6|4.43 
County 
Conzemius’ high school 
seniors 
Non-4-H members 27|38.9,4.32) 31/42.7/4.11 
4-H members 34.41.1.4.67) 2943.63.19 


bers in every comparison, though varan 


ces 


were not homogeneous in every instanc? 
Whether this difference represents selection 0" 
the special effect of 4-H experience, she does n° 


choose to say. Conzemius’ experienc 
sistent with ours in finding greater sex 
ences than in our previous study, an 
what greater relationship of score to age 
obtained in our previous work (3). 

The “Social Attitudes Scale” yields a num 


e is con- 
differ- 
d a some 
than 


ber 


of correlations with other measures which a7 
of interest. Table 3 reports a number of thes? 
and shows that correlations of score with socio” 
economic status are negligible, but with meas 
ures of adjustment they are moderately 18" 
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The correlation of the new scale with the old 
scale on two different samples of children who 
took the two schedules four years apart gives 
values of +.14 and +.32. 

Whereas the “How I Feel About Citizen- 

ship” measure (1950) showed no age trend, a 
noticeable age relationship appears in the 
“Social Attitudes” test data (1954). From the 
youngest to the oldest age groups, the shift in 
mean score is approxmately one standard 
deviation of the total distribution. Further- 
more, the relationship to age is more pro- 
nounced in girls than in boys. The differences 
between the means of the sexes is about one- 
fourth a standard deviation of the combined 
distributions in the younger ages and more 
than one-half a standard deviation in the 
oldest age groups. In the sense of this test, girls 
are perhaps better “socialized” than boys. 
; It will be recalled that less than half the 
items of the 1954 schedule appeared in the 1950 
measure. It may be that the 1950 measure, 
based strictly on empirically selected items, 
contained a large random component, and it is 
true that the reliability coefficients were not 
large (of the order of +.64 to +-.88 for a two- 
week interval). The 1954 measure, designed 
for construct as well as concurrent validity, 
seems to have included more age-related items. 
The reliability coefficients were comparable, 
being of the order of +.60 to +-.70 for a four- 
month retest interval. Both measures, however, 
showed correlations with adjustment measures 
of the order of +.30 to +.40 and better, and 
the “Social Attitudes Scale” consistently 
yields the larger values. Furthermore, there is 
a suggestion that the Social Attitudes test 
tends also in the adolescent period towards an 
increased association with other measures, 
especially measures of adjustment. There may 
be more organization or responsibility as a 
focal characteristic as the child matures in his 
experience. 

In 1950, 26 of the 43 items (60%) were 
personal reference items, couched in the first 

erson and having to do with the child’s own 
conduct. In 1954, only 18 of the 50 items (36%) 
were of this character. The tests for a stratified 
sample® in age groups 10, 12, 14, and 16 were 

6 For other research purposes a “standard sample” 
of 50 boys and 50 girls had been selected at each year 


of age from our larger supply of children (running 
about 300 in each age group). This sample was stratified 


TABLE 3 
CORRELATIONS OF RESPONSIBILITY TEST SCORES 
WITH CERTAIN OTHER VARIABLES: Boys 
AND GIRLS COMBINED 


Ages | Ages | Ages | Ages 
Variable 9 &|11 &|13 &|15 & 
10 | 12 | 14 | 16 


Father's education 5 | .17 | :244) 15 
Father's occupation (1954) .06 | .07 | .11 | .06 
“How I Feel About Things”* 
Adjustment score .38 | -44 | .42 | 49 
Adjustment-maturity score .34 | .45 | .46 | .47 
Sentence Completion Test | .03 | -16 2231) 27 
score | 


a A schedule in the Nobles County study of attitude and in- 
terest items selected empirically to discriminate between well- 
adjusted and poorly adjusted children. 

b This is also a Nobles County study instrument. It is scored 
according to the number of items showing positi affect in the 
completion supplied. This score has a low positive correlation 
with a criterion of adjustment. 


TABLE 4 
MEANS AND STANDARD DEVIATIONS FOR TOTAL SCORE 
ON THE SOCIAL ATTITUDES MEASURE AND 
FOR PERSONAL REFERENCE ITEMS AND 
NONPERSONAL REFERENCE ITEMS 


Total Score Remon Ng ia 

Age| Sex | N | Items Items 
Mean | SD | Mean | SD | Mean | SD 
w| B | 93 | 361 | 5.02} 11.98 | 2.60 | 22.63 | 4.30 
cG | 91 | 37.08 | 4.75 | 13.63 | 1.82 | 23.45 | 3.90 
| B | 95 | 35-9 | 5-14 | 12.49 | 2.11 24.44 | 4.06 
Gc | 98 | 39.42| 3.99] 13.86 | 2.23 | 25.57 | 4.44 
a | B | 92 | 36-48] 6.77 | 11.36 | 2.73 | 25.12 | 5.03 
G | 93 | 40.53 | 4.48 | 13.39 | 2.07 | 27.14 | 3.39 
je | B | 9e | 3778 | 6-56 12.01 | 2.59 | 25.73 | 4.85 
e|] 41.57 | 3.97 | 13.43 | 2.01 | 28.14 | 3.01 


recorded to provide separate scores on the 18 
personal reference items (see marked items in 
Table 1) and on the 32 nonpersonal reference 
items (unmarked items in Table 1.) Table 4 
makes it clear that the age increment in score 
on the Social Attitudes measure as a whole is 
due to age change in the nonpersonal reference 
items. 

An analysis of another aspect of the Nobles 
County work? has shown that adjustment 
items of the psychoneurotic type are almost 


according to occupation of the father, to represent 
the U.S. 1950 occupational distribution. 
7 As yet unpublished. 
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entirely of this personal reference type and are 
also free from an age relationship. Perhaps 
attitudes toward the community, society, and 
human relationships in general are modified by 
education, experience and maturity, while self- 
attitudes and personal habits are less amenable 
to change. Or, possibly, children progressively 
acquire more socially acceptable verbal state- 
ments about society and social relations as 
they move through school but fail to modify 
expressions about themselves, their self- 
feelings, and their own habits. Few studies of 
the “self image” have drawn this contrast, 
and this should prove a fruitful hypothesis for 
further work. 

Correlations between the personal and non- 
personal subscores are modest, ranging from 
+.29 to +.53, and being characteristically 
around +.40.* There is no appreciable tendency 
for correlational values to change systemat- 
ically with age, so that no argument can be 
offered concerning “integration” or ‘‘differen- 
tiation” of attitudes measured by these scales, 
over the age range represented in this study. 
Some information on the reliability of these 
subscales may be of interest. For a sample of 
50 twelve-year-old girls the estimate of relia- 
bility obtained by Hoyt’s analysis of variance 
method (5) was +.73. For the subscale of 18 
personal reference items, the estimate of reli- 
ability was +-.47, and for the 32 nonpersonal 
reference items the value was +.63. 


SUMMARY 


A scale of social attitudes was designed to 
discriminate children who have, with their 
peers, a reputation for responsibility as con- 


8 American Documentation Institute Document 


No. 5323, op. cit. 


Harris 


trasted with children who have little reputa- 
tion for responsibility. The resulting scale 
yields a measure which is substantially corre- 
lated with other measures of personal and 
social adjustment. Unlike an earlier scale de- 
signed for a similar purpose, the present scale 
shows a positive trend in mean score with age. 
Analysis revealed that this age trend results 
from a larger proportion of items in the present 
scale which refers to social and societal obliga- 
tions (or at least do not contain a highly per- 
sonal reference as indicated by the first per- 
sonal singular pronoun). Items which have a 
highly personal reference, referring to self- 
feelings, attitudes, and reports of personal be- 
havior, do not show an age trend. The two 
sets of items are modestly intercorrelated, on 
the order of +-.40. 
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A CROSS-CULTURAL SURVEY OF SOME SEX DIFFERENCES 
IN SOCIALIZATION! 


HERBERT BARRY III, MARGARET K. BACON, anp IRVIN L. CHILD 
Yale University 


N OUR society, certain differences may be 
observed between the typical personality 
A characteristics of the two sexes. These sex 
differences in personality are generally believed 
to result in part from differences in the way 
boys and girls are reared. To the extent that 
personality differences between the sexes are 
thus of cultural rather than biological origin, 
they seem potentially susceptible to change. 
But how readily susceptible to change? In the 
differential rearing of the sexes does our society 
make an arbitrary imposition on an infinitely 
plastic biological base, or is this cultural im- 
Position found uniformly in all societies as an 
adjustment to the real biological differences 
between the sexes? This paper reports one 
attempt to deal with this problem. 


DATA AND PROCEDURES 


The data used were enthnographic reports, 
available in the anthropological literature, 
about socialization practices of various cul- 
tures. One hundred and ten cultures, mostly 
nonliterate, were studied. They were selected 
primarily in terms of the existence of adequate 
ethnographic reports of socialization practices 
and secondarily so as to obtain a wide and 
reasonably balanced geographical distribution. 
Various aspects of socialization of infants and 
children were rated on a 7-point scale by two 
judges (Mrs. Bacon and Mr. Barry). Where 
the ethnographic reports permitted, separate 
ratings were made for the socialization of boys 
and girls. Each rating was indicated as either 
confident or doubtful; with still greater un- 
certainty, or with complete lack of evidence, 
the particular rating was of course not made at 
all. We shall restrict the report of sex difference 
ratings to cases in which both judges made a 
confident rating. Also omitted is the one in- 
stance where the two judges reported a sex 
difference in opposite directions, as it demon- 


1 This research is part of a project for which financial 
support was provided by the Social Science Research 
Council and the Ford Foundation. We are greatly in- 
debted to G. P. Murdock for supplying us with certain 
data, as indicated below, and to him and Thomas W. 
Maretzki for suggestions that have been used in this 
paper. 


strates only unreliability of judgment. The 
number of cultures that meet these criteria is 
much smaller than the total of 110; for the 
several variables to be considered, the number 
varies from 31 to 84. 

The aspects of socialization on which ratings 
were made included: 


1. Several criteria of attention and indulgence 
toward infants. 

2. Strength of socialization from age 4 or 5 years 
until shortly before puberty, with respect to five 
systems of behavior; strength of socialization was 
defined as the combination of positive pressure (rewards 
for the behavior) plus negative pressure (punishments 
for lack of the behavior). The variables were: 

(a) Responsibility or dutifulness training. (The data 
were such that training in the performance of chores in 
the productive or domestic economy was necessarily the 
principal source of information here; however, training 
in the performance of other duties was also taken into 
account when information was available.) 

(b) Nurturance training, i.e., training the child to 
be nurturant or helpful toward younger siblings and 
other dependent people. 

(c) Obedience training. 

(d) Self-reliance training. 

(e) Achievement training, i.e., training the child to 
orient his behavior toward standards of excellence in 
performance, and to seek to achieve as excellent a 
performance as possible. 


Where the term “no sex difference” is used 
here, it may mean any of three things: (a) the 
judge found separate evidence about the train- 
ing of boys and girls on this particular variable, 
and judged it to be identical; (b) the judge 
found a difference between the training of 
boys and girls, but not great enough for the 
sexes to be rated a whole point apart on a 7- 
point scale; (c) the judge found evidence only 
about the training of “children” on this 
variable, the ethnographer not reporting sep- 
arately about boys and girls. 


Sex DIFFERENCES IN SOCIALIZATION 


On the various aspects of attention and 
indulgence toward infants, the judges almost 
always agreed in finding no sex difference. Out 
of 96 cultures for which the ratings included the 
infancy period, 88 (92%) were rated with no 
sex difference by either judge for any of those 
variables. This result is consistent with the 
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TABLE 1 
RATINGS OF CULTURES FOR SEX DIFFERENCES ON FIVE VARIABLES OF CHILDHOOD SOCIALIZATION PRESSURE 


Both Judges Agree in 
Rating the Variable 


Percentage of Cultures with 


One Judge Rates No 
Evidence of Sex Difference in 


Difference, One Rates the 


Variable Numbenot Higher in Variable Higher in Direction of 
Girls Boys Girls Boys Girls Boys Neither 
Nurturance 33 17 0 10 0 82% 0% 18% 
Obedience 69 6 0 18 2) 35% 3% 62% 
Responsibility 84 25 2 26 7 61% 11% 28% 
Achievement 31 0 17 1 10 3% 87% 10% 
Self-reliance 82 0 64 0 6 0% 85% 15% 


point sometimes made by anthropologists 
that “baby” generally is a single status un- 
differentiated by sex, even though “boy” and 
“girl” are distinct statuses. 

On the variables of childhood socialization, 
on the other hand, a rating of no sex difference 
by both judges was much less common. This 
finding of no sex difference varied in frequency 
from 10% of the cultures for the achievement 
variable up to 62% of the cultures for the 
obedience variable, as shown in the last column 
of Table 1. Where a sex difference is reported, 
by either one or both judges, the difference 
tends strongly to be in a particular direction, 
as shown in the earlier columns of the same 
table. Pressure toward nurturance, obedience, 
and responsibility is most often stronger for 
girls, whereas pressure toward achievement 
and self-reliance is most often stronger for boys. 

For nurturance and for self-reliance, all the 
sex differences are in the same direction. For 
achievement there is only one exception to the 
usual direction of difference, and for obedience 
only two; but for responsibility there are nine, 
What do these exceptions mean? We have re- 
examined all these cases. In most of them, only 
one judge had rated the sexes as differently 
treated (sometimes one judge, sometimes the 
other), and in the majority of these cases both 
judges were now inclined to agree that there 
was no convincing evidence of a real difference, 
There were exceptions, however, especially in 
cases where a more formal or systematic train- 
ing of boys seemed to imply greater pressure on 
them toward responsibility. The most con- 
vincing cases were the Masai and Swazi, where 
both judges had originally agreed in rating 
responsibility pressures greater in boys than in 
girls. In comparing the five aspects of socializa- 
tion we may conclude that responsibility shows 
by far the strongest evidence of real variation 
in the direction of sex difference, and obedience 


much the most frequently shows evidence of 
no sex difference at all. 

In subsequent discussion we shall be assum- 
ing that the obtained sex differences in the 
socialization ratings reflect true sex differences 
in the cultural practices. We should consider 
here two other possible sources of these rated 
differences. 

1. The ethnographers could have been biased 
in favor of seeing the same pattern of sex dif- 
ferences as in our culture. However, most 
anthropologists readily perceive and eagerly 
report novel and startling cultural features, so 
we may expect them to have reported unusual 
sex differences where they existed. The dis- 
tinction between matrilineal and patrilineal, 
and between matrilocal and patrilocal cultures, 
given prominence in many ethnographic re- 
ports, shows an awareness of possible variations 
in the significance of sex differences from 
culture to culture. 

2. The two judges could have expected to 
find in other cultures the sex roles which are 
familiar in our culture and inferred them from 
the material on the cultures, However, we have 
reported only confident ratings, and such a 
bias seems less likely here than for doubtful 
ratings. It might be argued, moreover, that 
bias has more Opportunity in the cases ambig- 
uous enough so that only one judge reported a 
Sex difference, and less opportunity in the cases 
where the evidence is so clear that both judges 
agree. Yet in general, as may be seen in Table 
1, the deviant cases are somewhat more fre- 
quent among the cultures where only one judge 
reported a sex difference, 

The observed differences in the socialization 
of boys and girls are consistent with certam 
universal tendencies in the differentiation ° 
adult sex role. In the economic sphere, men are 
more frequently allotted tasks that involve 
leaving home and engaging in activities wher? 


Sex DIFFERENCES IN SOCIALIZATION 


a high level of skill yields important returns; 
hunting is a prime example. Emphasis on 
training in self-reliance and achievement for 
boys would function as preparation for such an 
economic role. Women, on the other hand, are 
more frequently allotted tasks at or near home 
that minister most immediately to the needs of 
others (such as cooking and water carrying); 
these activities have a nurturant character, 
and in their pursuit a responsible carrying out 
of established routines is likely to be more 
Important than the development of an es- 
pecially high order of skill. Thus training in 
nurturance, responsibility, and, less clearly, 
obedience, may contribute to preparation for 
this economic role. These consistencies with 
adult role go beyond the economic sphere, of 
Course. Participation in warfare, as a male 
Prerogative, calls for self-reliance and a high 
order of skill where survival or death is the 
immediate issue. The childbearing which is 
biologically assigned to women, and the child 
care which is socially assigned primarily to 
them, lead to nurturant behavior and often 
call for a more continuous responsibility than 
do the tasks carried out by men. Most of these 
distinctions in adult role are not inevitable, 
but the biological differences between the sexes 
strongly predispose the distinction of role, if 
made, to be in a uniform direction.’ 

The relevant biological sex differences are 
conspicuous in adulthood but generally not in 
childhood. If each generation were left entirely 
to its own devices, therefore, without even an 
older generation to copy, sex differences in role 
would presumably be almost absent in child- 
hood and would have to be developed after 
puberty at the expense of considerable re- 
learning on the part of one or both sexes. 
Hence, a pattern of child training which 
foreshadows adult differences can serve the 
useful function of minimizing what Benedict 
termed “discontinuities in cultural condi- 
tioning” (1). 

The differences in socialization between the 
sexes in our society, then, are no arbitrary 
custom of our society, but a very widespread 
adaptation of culture to the biological sub- 
stratum of human life. 


2For data and interpretations supporting various 
arguments of this paragraph, see Mead (2), Murdock 
(3), and Scheinfeld (6). 
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VARIATIONS IN DEGREE OF 
Sex DIFFERENTIATION 


While demonstrating near-universal tend- 
encies in direction of difference between the 
socialization of boys and girls, our data do not 
show perfect uniformity. A study of the varia- 
tions in our data may allow us to see some of 
the conditions which are associated with, and 
perhaps give rise to, a greater or smaller degree 
of this difference. For this purpose, we classified 
cultures as having relatively large or small sex 
difference by two different methods, one more 
inclusive and the other more selective. In both 
methods the ratings were at first considered 
separately for each of the five variables. A sex 
difference rating was made only if both judges 
made a rating on this variable and at least one 
judge’s rating was confident. 

In the more inclusive method the ratings 
were dichotomized, separately for each 
variable, as close as possible to the median 
into those showing a large and those showing a 
small sex difference. Thus, for each society a 
large or a small sex difference was recorded for 
each of the five variables on which a sex dif- 
ference rating was available. A society was 
given an over-all classification of large or small 
sex difference if it had a sex difference rating on 
at least three variables and if a majority of 
these ratings agreed in being large, or agreed in 
being small. This method permitted classifica- 
tion of a large number of cultures, but the 
grounds for classification were capricious in 
many cases, as a difference of only one point in 
the rating of a single variable might change 
the over-all classification of sex difference for a 
culture from large to small. 

In the more selective method, we again 
began by dichotomizing each variable as close 
as possible to the median; but a society was 
now classified as having a large or small sex 
difference on the variable only if it was at least 
one step away from the scores immediately 
adjacent to the median. Thus only the more 
decisive ratings of sex difference were used. A 
culture was classified as having an over-all 
large or small sex difference only if it was given 
a sex difference rating which met this criterion 
on at least two variables, and only if all such 
ratings agreed in being large, or agreed in being 
small. 

We then tested the relation of each of these 
dichotomies to 24 aspects of culture on which 
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TABLE 2 
CULTURE VARIABLES CORRELATED WITH LARGE 
SEx DIFFERENCE IN SOCIALIZATION, 
SEPARATELY FOR Two TYPES 


OF SAMPLE 
More Selec- | More Inclu- 
Variable tive Sample | sive Sample 
$ N $ N 
Large animals are hunted -48* (34)|.28* (72) 
Grain rather than root crops |.82** (20)).62**| (43) 
are grown 
Large or milking animals -65* | (19) .43* | (35) 
rather than small animals 
are kept 
Fishing unimportant or absent |.42* (31)|.19 | (69) 
Nomadic rather than seden- -61**) (34)|.15 |(71) 
tary residence 
Polygyny rather than monog- |.51* (28)| .38**| (64) 
amy 
* p < 05. 
PFD xz D 


Note.—The variables have been so phrased that all correla- 
tions are positive. The phi coefficient is shown, and in parenthe- 
ses, the number of cases on which the comparison was based. 
Significance level was determined by x2, or Fisher’s exact test 
where applicable, using in all cases a two-tailed test. 


Murdock has categorized the customs of most 
of these societies’ and which seemed of possible 
significance for sex differentiation. The aspects 
of culture covered include type of economy, 
residence pattern, marriage and incest rules, 
political integration, and social organization. 
For each aspect of culture, we grouped 
Murdock’s categories to make a dichotomous 
contrast (sometimes omitting certain categories 
as irrelevant to the contrast). In the case of 
some aspects of culture, two or more separate 
contrasts were made (e.g., under form of mar- 
riage we contrasted monogamy with polygyny, 
and also contrasted sororal with nonsororal 
polygyny). For each of 40 comparisons thus 
formed, we prepared a 2 x 2 frequency table to 
determine relation to each of our sex-diff erence 
dichotomies. A significant relation was found 
for six of these 40 aspects of culture with the 
more selective dichotomization of over-all sex 
difference. In four of these comparisons, the 
relation to the more inclusive dichotomization 
was also significant. These relationships are 
all given in Table 2, in the form of phi coef- 
ficients, along with the outcome of testing sig- 
nificance by the use of x? or Fisher’s exact test. 
In trying to interpret these findings, we have 
also considered the nonsignificant correlations 


* These data were supplied to us directly by Profes- 
sor Murdock. 
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with other variables, looking for consistency 
and inconsistency with the general implications 
of the significant findings. We have arrived at 
the following formulation of results: 

1. Large sex difference in socialization is 
associated with an economy that places a high 
premium on the superior strength, and superior 
development of motor skills requiring strength, 
which characterize the male. Four of the cor- 
relations reported in Table 2 clearly point to 
this generalization: the correlations of large 
sex difference with the hunting of large animals, 
with grain rather than root crops, with the 
keeping of large rather than small domestic 
animals, and with nomadic rather than sed- 
entary residence. The correlation with the un- 
importance of fishing may also be consistent 
with this generalization, but the argument is 
not clear.t Other correlations consistent with 
the generalization, though not statistically 
significant, are with large game hunting rather 
than gathering, with the hunting of large game 
rather than small game, and with the general 
importance of all hunting and gathering. 

2. Large sex difference in socialization ap- 
pears to be correlated with customs that make 
for a large family group with high cooperative 
interaction. The only statistically significant 
correlation relevant here is that with polygyny 
rather than monogamy. This generalization is, 
however, supported by several substantial 
correlations that fall only a little short of being 
statistically significant. One of these is a 


‘Looking (with the more inclusive sample) into the 
Possibility that this correlation might result from the 
correlation between fishing and sedentary residence, 2 
ction between these variables was 
found. The Correlation of sex differentiation with 
absence of fishing is found only in nomadic societies, 
kely to involve cooperative activity 
and its absence is likely to mean 
he male for large game hunting ar 
mals (whereas in sedentary societies 
a o fishing do not so uniformly require 
special emphasis on male strength). The correlation 0 
sex differentiation with nomadism is found only 
nonfishing societies; here nomadism is likely to imply 
large game hunting or herding large animals, whereas 
in fishing societies nomadism evidently implies no suc 
special dependence upon male strength. Maximum Së 
differentiation is found in nomadic nonfishing societie? 
(15 with large difference and only 2 with small) a” 
minimum sex differentiation in nomadic fishing societi> 
(2 with large difference and 7 with small difference 
These findings further strengthen the argument for 


ji R e! 
conspicuous influence of the economy upon sex di 
entiation. e 
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correlation with sororal rather than nonsororal 
polygyny; Murdock and Whiting (4) have 
presented indirect evidence that co-wives 
generally show smoothercooperative interaction 
if they are sisters. Correlations are also found 
with the presence of either an extended or a 
polygynous family rather than the nuclear 
family only; with the presence of an extended 
family; and with the extreme contrast between 
maximal extension and no extension of the 
family. The generalization is also to some extent 
supported by small correlations with wide 
extension of incest taboos, if we may presume 
that an incest taboo makes for effective un- 
threatening cooperation within the extended 
family. The only possible exception to this 
generalization, among substantial correlations, 
is a near-significant correlation with an ex- 
tended or polygynous family’s occupying a 
cluster of dwellings rather than a single 
dwelling.® 

In seeking to understand this second 
generalization, we feel that the degree of social 
isolation of the nuclear family may perhaps be 
the crucial underlying variable. To the extent 
that the nuclear family must stand alone, the 
man must be prepared to take the woman’s role 
when she is absent or incapacitated, and vice 
versa. Thus the sex differentiation cannot 
afford to be too great. But to the extent that 
the nuclear family is steadily interdependent 
with other nuclear families, the female role in 
the household economy can be temporarily 
taken over by another woman, or the male role 
by another man, so that sharp differentiation 
of sex role is no handicap. 

The first generalization, which concerns 
the economy, cannot be viewed as dealing with 
material completely independent of the ratings 
of socialization. The training of children in 

5 We think the reverse of this correlation would be 
more consistent with our generalization here. But 
perhaps it may reasonably be argued that the various 
nuclear families composing an extended or polygynous 
family are less likely to develop antagonisms which 
hinder cooperation if they are able to maintain some 
physical separation. On the other hand, this variable 
may be more relevant to the first generalization than 
to the second. Occupation of a cluster of dwellings is 
highly correlated with presence of herding and with 
herding of large rather than small animals, and these 
economic variables in turn are correlated with large 
sex difference in socialization. Occupation of a cluster 
of dwellings is also correlated with polygyny rather 
than monogamy and shows no correlation with sororal 
vs. nonsororal polygyny. 
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their economic role was often an important 
part of the data used in rating socialization 
variables, and would naturally vary according 
to the general economy of the society. We 
would stress, however, that we were by no 
means using the identical data on the two sides 
of our comparison; we were on the one hand 
judging data on the socialization of children 
and on the other hand using Murdock’s 
judgments on the economy of the adult culture. 
In the case of the second generalization, it 
seems to us that there was little opportunity 
for information on family and social structure 
to have influenced the judges in making the 
socialization ratings. 

Both of these generalizations contribute to 
understanding the social background of the 
relatively small difference in socialization of 
boys and girls which we believe characterizes 
our society at the present time. Our mechanized 
economy is perhaps less dependent than any 
previous economy upon the superior average 
strength of the male. The nuclear family in our 
society is often so isolated that husband and 
wife must each be prepared at times to take 
over or help in the household tasks normally 
assigned to the other. It is also significant that 
the conditions favoring low sex differentiation 
appear to be more characteristic of the upper 
segments of our society, in socioeconomic and 
educational status, than of lower segments. 
This observation may be relevant to the 
tendency toward smaller sex differences in 
personality in higher status groups (cf. 
Terman and Miles, 8). 

The increase in our society of conditions 
favoring small sex difference has led some 
people to advocate a virtual elimination of sex 
differences in socialization. This course seems 
likely to be dysfunctional even in our society. 
Parsons, Bales, et al. (5) argue that a differ- 
entiation of role similar to the universal pattern 
of sex difference is an important and perhaps 
inevitable development in any social group, 
such as the nuclear family. If we add to their 
argument the point that biological differences 
between the sexes make most appropriate the 
usual division of those roles between the sexes, 
we have compelling reasons to expect that the 
decrease in differentiation of adult sex role will 
not continue to the vanishing point. In our 
training of children, there may now be less 
differentiation in sex role than characterizes 
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adult life—so little, indeed, as to provide in- 
adequate preparation for adulthood. This 
state of affairs is likely to be especially true of 
formal education, which is more subject to 
conscious influence by an ideology than is in- 
formal socialization at home. With child 
training being more oriented toward the male 
than the female role in adulthood, many of the 
adjustment problems of women in our society 
today may be partly traced to conflicts growing 
out of inadequate childhood preparation for 
their adult role, This argument is nicely sup- 
ported in extreme form by Spiro’s analysis of 
sex roles in an Israeli kibbutz (7). The ideology 
of the founders of the kibbutz included the 
objective of greatly reducing differences in sex 
role. But the economy of the kibbutz is a 
largely nonmechanized one in which the supe- 
rior average strength of men is badly needed in 
many jobs. The result is that, despite the 
ideology and many attempts to implement it, 
women continue to be assigned primarily to 
traditional “women’s work,” and the in- 
compatibility between upbringing or ideology 
and adult role is an important source of conflict 
for women. 

Note on regional distribution. There is marked 
variation among regions of the world in typical 
size of sex difference in socialization. In our 
sample, societies in North America and Africa 
tend to have large sex difference, and societies 
in Oceania to have small sex difference. Less 
confidently, because of the smaller number of 
cases, we can report a tendency toward small 
sex differences in Asia and South America as 
well. Since most of the variables with which we 
find the sex difference to be significantly cor- 
related have a similar regional distribution, 
the question arises whether the correlations 
might better be ascribed to some quite different 
source having to do with large regional simi- 
Jarities, rather than to the functional depend- 
ence we have suggested. As a partial check, we 
have tried to determine whether the correlations 
we report in Table 2 tend also to be found 
strictly within regions. For each of the three 

regions for which we have sizable samples 
(North America, Africa, and Oceania) we have 
separately plotted 2 x 2 tables corresponding 
to each of the 6 relationships reported in Table 
2. (We did this only for the more inclusive 
sample, since for the more selective sample the 
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number of cases within a region would have 
been extremely small.) Out of the 18 correla- 
tions thus determined, 11 are positive and only 
3 are negative (the other 4 being exactly zero). 
This result clearly suggests a general tendency 
for these correlations to hold true within 
regions as well as between regions, and may 
lend further support to our functional inter- 
pretation. 


SUMMARY 


A survey of certain aspects of socialization in 
110 cultures shows that differentiation of the 
sexes is unimportant in infancy, but that in 
childhood there is, as in our society, a wide- 
spread pattern of greater pressure toward 
nurturance, obedience, and responsibility in 
girls, and toward self-reliance and achievement 
striving in boys. There are a few reversals of 
sex difference, and many instances of no de- 
tectable sex difference; these facts tend to 
confirm the cultural rather than directly 
biological nature of the differences, Cultures 
vary in the degree to which these differentia- 
tions are made; correlational analysis suggests 
some of the social conditions influencing these 
variations, and helps in understanding why our 
society has relatively small sex differentiation. 


REFERENCES 


1. Benenicr, Ruri. Continuitics and discontinuities 
x aa conditioning. Psychiatry, 1938, 1» 
1-167. 


- MEAD, MarcareT. Male and female. New York: 
Morrow, 1949, 
3. Murdock, G. P. Comparative data on the division 
of labor by sex. Social Forces, 1937, 15, 551-553. 
4. Murvock, G. P., & Wnuirinc, J. W. M. Cultural 
determination of parental attitudes: The relation- 
ship between the social structure, particularly 
family structure and parental behavior. In M. J- 
E. Senn (Ed.), Problems of infancy and childhood: 
Transactions of the Fourth Conference, Mare 
6-7, 1950. New York: Josiah Macy, Jr. Founda- 
tion, 1951. Pp. 13-34, 
5 Parsons, T., BALES, R. F., et al. Family, socialize 
tion and interaction process. Glencoe, Ill.: Free 
Press, 1955. 
SCHEINFELD, A. Women and men. New York: Hat 
court, Brace, 1944, 
7. Sero, M. E. Kibbutz: 
bridge: Harvard Unive: 
8. Terman, L. M., & 
personality. New 


Venture in Utopia. Cam 
r. Press, 1956. d 
Mues, Carnerine C. Sea 4” 
York: McGraw-Hill, 1936. 


Received February 6, 1957, 


SE SEE S—t—‘itsS 
es 


AVOIDANCE LEARNING OF PERCEPTUAL DEFENSE 
AND VIGILANCE? 
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nosr who argue for a perceptual defense 

mechanism analogous to repression and 

those who believe that the adaptive 
value of sensitization to threat suggests a 
Vigilance mechanism can both find support ina 
wealth of controversial data (3, 7)—as well as 
can those who would deny both views (4, 11, 
12, 13). Much of the confusion has resulted 
from the use of linguistic stimuli whose sig- 
nificance to the subject is a matter of dispute. 
So long as the history of the subjects’ com- 
merce with the stimulus materials remains un- 
certain, convenient assumptions permit any of 
the several views to be salvaged in the face of 
most data. The chief object of the present study 
is to produce experimentally the learning of 
perceptual defense and vigilance, and in so 
doing, to relate the mechanisms to antecedent 
experimental procedures. 

Knowing that a stimulus is anxiety arousing 
does not in itself permit us to predict the 
perceptual reaction to that stimulus. Lazarus, 
Eriksen, and Fonda (6) have demonstrated 
both perceptual defense and vigilance reactions 
to the same hostile or sexual, and presumably 
threatening, material; which reaction occurs is 
predicted from diagnostic categories. En- 
countering stimuli with sexual or hostile 
meaning is no novelty to the adult subject, who 
has had ample opportunity for learning to 
handle any anxiety they engender. We might 
guess that one kind of subject has learned to 
defend against threatening stimuli because 
with his particular experiences and personal 
economy that reaction has somehow been to his 
advantage. Another has learned perceptual 
vigilance because that reaction served him well. 
The specific learning mechanism involved has 
not been made clear. Hence, a second purpose 
of this study is to offer an account in terms of 
behavior theory of how perceptual defense and 
vigilance may be learned and to test two 
hypotheses derived from that account. 


1'This article is based on a doctoral dissertation 
submitted to the Department of Psychology, Uni- 
versity of Michigan. I wish to express my gratitude to 
Dr, E. L. Walker for his generous advice and guidance 
as chairman of my doctoral committee. 


This analysis extends to perception a two- 
stage conception of avoidance learning that is 
essentially in agreement with the views of 
Mowrer (9) and of Solomon (16). Once an 
anxiety response has been classically condi- 
tioned to a previously neutral stimulus, any 
number of perceptual reactions are theoret- 
ically possible—vigilance and defense, as well 
as many qualitative distortions. The con- 
venience of certain psychophysical methods 
has led to great experimental interest in two 
such reactions along the single dimension of 
recognizability. We may think of these po- 
tential perceptual reactions to a threatening 
stimulus as ordered in a hierarchy of proba- 
bility of occurrence. There is some evidence 
that perceptual vigilance holds the dominant 
position in an initial hierarchy of perceptual 
reactions to threat. Pustell (14) produced 
perceptual vigilance experimentally by clas- 
sically associating an electric shock with a num- 
ber of neutral geometric figures. Lazarus and 
McCleary (5) classically conditioned GSRs to 
nonsense syllables, obtaining their threshold 
measures under threat of further punishment, 
but failed to find a significant difference be- 
tween thresholds for their shock syllables and a 
number of nonshock syllables. Could a re- 
luctance to name the shock syllable have 
cancelled out a primitive vigilance reaction? 
Lysak (8) replicated their finding, then in 
another experiment removed the electrodes 
before obtaining threshold measures and found 
thresholds for his shock syllables to be lower 
than those for nonshock syllables. These 
findings suggest that perceptual vigilance is 
dominant in an initial hierarchy of perceptual 
reactions to a threatening stimulus, although 
response suppression may obscure the effect 
when there is a continuing imminent possibility 
of punishment. The superior adaptive value of 
such a mechanism is obvious. 

Whatever perceptual reaction is initially 
dominant, it should be possible to realign this 
hierarchy by introducing selective reinforce- 
ment. The perceptual reaction that is in- 
strumental to the avoidance of punishment and 
reduction of the conditioned anxiety should be 
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strengthened. In the course of ordinary ex- 
perience, perceptual defense and vigilance 
occur in the presence of varied and competing 
stimuli. They name two kinds of imbalance 
among a set of competing perceptual responses. 
If one perceptual response is punished while 
competing perceptual responses are instru- 
mental to the avoidance of punishment and a 
reduction in anxiety, perceptual defense should 
be learned, since competing perceptual re- 
sponses should tend to crowd out the critical 
perceptual response. On the other hand, 
perceptual vigilance should be learned when 
one perceptual response is instrumental to 
avoidance of punishment while competing 
perceptual responses are punished. Specifically, 
the following hypotheses are tested: (a) When 
one perceptual response is followed by punish- 
ment and competing perceptual responses are 
instrumental to avoidance of punishment, the 
punished response becomes weaker as com- 
pared with the competing perceptual responses; 
(b) When one perceptual response is instru- 
mental to avoidance of punishment and com- 
peting perceptual responses are punished, the 
avoidance response becomes Stronger as 
compared with the competing perceptual 
responses. 


METHOD 
Subjects 


The Ss were 32 undergraduates at the University of 
Michigan, evenly divided between the defense and 
vigilance training procedures. They were randomly 
assigned to the two procedures, the only requirement 
being that an equal number of men and women partic- 
ipate in each. All Ss were volunteers, and none was 
aware of the purpose of the experiment. 


Stimulus Materials 


Four rather meaningless, emotionally neutral figures— 
a circle, diamond, square, and triangle—were cut from 
black Munsell paper and mounted on white cards. The 
figures were arranged top, bottom, left, and right, on 
each of four cards, so that each figure appeared in all 
four positions. Hence, if each of the four cards was 
presented to the S$ an equal number of times, each 
figure would appear an equal number of times in each 
of the four positions. In this way, positional (see 10) 
and frequency effects were controlled. Prior to this 
experiment, the figures had been roughly equated for 
discriminability by Thomas Pustell 


Apparatus 


A Gerbrands tachistoscope, designed for individual 
use, was adjusted for a constant exposure interval of 
-12 sec. throughout all experimental series. With a 
pre-exposure field of 6.29 ft, lamberts, an exposure 
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field of 0.425 ft. lamberts, and geometrical forms of 
less than one inch diameter, this exposure time pro- 
duced a level of awareness too low for the figures to be 
readily named. 

The electric shock applicator contained four pairs of 
electrodes spaced about an inch apart and was designed 
to be strapped to the leg of the S just above the ankle. 
In order to discourage adaptation and at the same time 
Prevent burns, the shock could be sent through first 
one pair of electrodes and then another from trial to 
trial. The shock circuits and applicator were designed 
by Dr. Carl Brown. 


Defense Training Experiment 


Level of awareness series. The experimental 
session began for each S with an assessment of 
his level of awareness, A particular exposure 
duration, together with illumination level, does 
not define a single level of awareness for more 
than one S. To ascertain whether level of 
awareness was related to the amount and 
character of the experimental effect, the S was 
given a simple discrimination task. A forced- 
choice technique developed by Blackwell (1) 
and modified by Blum (2) was selected for its 
sensitivity in measuring discrimination without 
awareness. It has the additional value of 
permitting a number of stimuli to be presented 
at once, a condition that more truly represents 
most perceptual situations, 

The S was informed that he would be shown 
a number of cards, but that they would be 
flashed too briefly for him to see much. On each 
trial, Æ asked S to “Find the square” or “Find 
the diamond,” etc. The S responded with 
“Top,” “Bottom,” “Left,” “Right,” depending 
on where he thought he saw the figure. He was 
Instructed on each trial to focus on the fixaticn 
point in the center of the field. Sixty-four 
trials were Tun, and S’s level of awareness was 
measured by his percentage of correct 
identifications, 

Baseline series. Although the figures were 
roughly equal in recognizability for a group 
collectively, individuals find one figure of 
another more salient because of their peculiar 
perceptual histories and predilections. For 
this reason an empirical baseline of recogniz® 
bility of each figure was obtained for every Í 
he procedure for this series was like that ° 
the Preceding in every way except that the 
Was Instructed to “tell me on each trial whic? 

gure is clearest—which one is most recs” 
Ges He was again asked to eee ft ” 
olce i “ 2 a sa. «Leit, 
or “Right and te Pai m: henever 
At, ollow his hunch w. 
uncertain, Any experimental effect produce 
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by the following training procedure should reg- 
ister as a departure from this empirical base- 
line. 

Training series. The defense training pro- 
cedure was essentially like the baseline series, 
with the important difference that reinforce- 
ment was introduced. One of the geometrical 
figures, selected arbitrarily, was designated the 
critical stimulus. Each of the figures was the 
critical one for four of the 16 Ss. As in the 
preceding series, S was instructed to report on 
€ach trial which of the four figures was most 
recognizable by indicating its position. When 
the critical stimulus was selected, an electric 
shock followed two seconds after exposure of 
the figures and lasted another two seconds. 
Selection of one of the other three figures as 
Most recognizable resulted in avoidance of the 
shock. 

Circuits were designed so as to permit S’s 

Pressing of one of four buttons to reveal his 
choice of figures and control presentation of 
the shock. The critical stimulus, of course, 
varied in position from trial to trial, and there- 
fore no single one of the four responses was 
always punished. The Æ noted the position of 
the critical stimulus before each trial, and 
adjusted a commutator, thereby determining 
which of the four buttons would activate the 
shock circuit. 
_ For the purposes of this experiment it was 
imperative that the shock be punishing. It was 
necessary that it be decidedly unpleasant, even 
painful for best results, but short of incapac- 
itating. Since individual pain thresholds for 
electric shock vary so widely, the intensity of 
shock was set at a subjective level for each S. 
After he had been assured that the shock was 
quite harmless and encouraged to takeas much 
as he could stand, the shock was set at the 
level each S judged to be “just this side of 
beyond endurance.” By phenomenological 
concensus of 32 Ss, the shock was decidedly 
unpleasant, even painful, and like the jab of a 
hot needle. 

Sixty-four trials were run, approximately 
one every 10 seconds. After each block of 16 
trials, S rested briefly. 

‘Assessment series. Immediately after the 
training series, S was allowed to remove the 
shock applicator and was assured that there 
would be no more shock. The instructions and 
Procedure for the assessment series were 
identical with those for the baseline series. The 
Same cards were presented for 64 trials, and on 
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each trial S indicated which figure was most 
recognizable. Finally, he was questioned for any 
knowledge of the rule by which he had been 
shocked. 
Vigilance Training Experiment 

The vigilance training experiment repeated 
the defense training experiment with one 
important difference: the reinforcement pre- 
scription was reversed. When S selected one of 
the three noncritical stimuli as most recog- 
nizable, he was shocked; when he selected the 
critical stimulus, he avoided the shock. 


RESULTS AND DISCUSSION 


Relative perceptual response strength for 
each figure was measured by the frequency 
with which that figure was judged clearer than 
competing stimuli. The Ss of the defense 
training procedure should, if the first hypoth- 
esis is correct, select the critical figure less 
frequently in the assessment series, after train- 
ing, than they- did in the baseline series. 
Amount of shift in frequency of response 
ranged from 0 to —14 with a mean of —5.8. 
In order to evaluate the group shift, it was 
noted that 14 of the 16 Ss shifted in the pre- 
dicted direction while two remained unchanged. 
By the sign test this shift is significant with a 
p < .012 These findings confirm Hypothesis I. 

Hypothesis II leads to the contrary expecta- 
tion that Ss of the vigilance training procedure 
should select the critical stimulus more fre- 
quently during the assessment series than 
during the baseline series. Magnitude of shift 
ranged from —7 to +36 with a mean of +7.0. 
Thirteen of 16 Ss shifted in the anticipated 
direction, and this shift is significant by the 
sign test with a p< .025. Thus the second 
hypothesis is supported. 

We would also like to know what progressive 
changes take place throughout the training 
series, whether these changes follow a typical 
learning curve, and whether there is any sign of 
extinction in the assessment series. Conse- 
quently, pooled frequency of selection of the 
critical stimulus was graphed as a function of 
trials. 

For the defense training group (see Fig. 1), 
frequency of selection drops precipitously from 
the baseline to the early part of the training 
series. Apparently the learning takes place very 
rapidly, although the abruptness of the drop 


2Since the direction of experimental shift was pre- 
dicted, one-tail p values are reported. 
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BASELINE SERIES 


O— — —O VIGILANCE GROUP 
@©————-@ DEFENSE GROUP 


PERCENT FREQUENCY OF SELECTION 
OF THE CRITICAL STIMULUS 
pS 
[e] 


8 I6 24 32 40 48 56 64 8 


SUCCESSIVE EIGHT TRIAL 
Fic. 1. MEAN PERFORMANCE OF THE DEFENSE AND VIGILANCE G 
SERIES 


may be partly due to a chance increase in 
frequency toward the end of the baseline series, 
In the vigilance training series, frequency of 
selection increases throughout the training 
series in a manner strongly Suggestive of a 
learning curve. Since an anxiety reaction is 
very probably conditioned early in both training 
series, or even earlier, when the shock is 
adjusted to the appropriate subjective level, 
there is some reason to expect the instrumental 
learning of defense to proceed more rapidly 
than the learning of vigilance. Theoretically, 
vigilance is acquired as a single, critical, per- 
ceptual response, strengthened by anxiety 
reduction attendant upon shock avoidance; 
this is the kind of learning that ought to be 
describable by a familiar learning curve. The 
learning of perceptual defense, however, pro- 
ceeds as the three competing perceptual re- 
sponses are strengthened, by anxiety reduction, 
at the expense of the critical perceptual re- 
sponse. There is less basis for anticipating the 
function for this kind of learning. That the 
learning of defense did occur more rapidly than 
the learning of vigilance might be explained by 
the greater probability of reinforcement in the 
defense series. It is also likely that the greater 
number of shocks in the vigilance series 
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ASSESSMENT SERIES 


INTERVALS 
Rours THROUGHOUT EACH EXPERIMENTAL 


produced a higher level of anxiety in the 
vigilance group. Conceivably, level of anxiety 
drive could be related to the rapidity and 
course of learning. Finally, for neither pro~ 
cedure is there any evidence of extinction 10 
the assessment series; and in view of the well- 
documented resistance to extinction of avor 
ance responses (15, 17) this isnot too surprising: 
A problem of interpretation arises from the 
fact that the empirical baseline values diffe" 
somewhat from their expected value of 257" 
31.6% for the defense group and 23.0% for the 
vigilance group. While there is no reason@ 
alternative but to attribute this disparity y 
sampling fluctuation, it is important to A 
tinguish two possible sources of error. Hav? 
two deviant samples been drawn from 
population of responses that mark the baselin® 
Or have such deviant samples been draw 
from a population of Ss varying in percept e 
sensitivity to the critical stimuli? In m 
former case, we should expect some regressio 
toward the value of 25% in the assessm i 
series, quite apart from any effect of the an 
ing procedures. If the latter possibility 1 ex 
case, then there is no problem, and the fot 
perimental effect is the more impressive is 
having overridden figure preferences. 
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possible, for purposes of analysis, to eliminate 
those four Ss with the highest frequencies at 
the defense baseline and the three Ss lowest at 
the vigilance baseline, leaving the two baseline 
Means approximately equal. Of the 12 Ss 
remaining in the defense group, 10 selected 
the critical stimulus less frequently during the 
assessment series than at the baseline. By the 
sign test this shift is significant with p < .025. 
Ten of the 13 Ss remaining in the vigilance 
group shifted in the predicted direction 
(p < .05). It seems clear that the regression 
hypothesis cannot account for these results. 

How justified is the interpretation that these 
training procedures have changed the strength 
of perceptual responses rather than S’s disposi- 
tion to make a report that has invariably 
brought him a painful electric shock? If S at 
any time discerned a connection between 
pressing the button for a square and being 
shocked, it would be only prudent for him to 
take a good look on each trial, then press one 
of the other buttons. For a number of reasons 
this suggested process cannot account for 
these results. Only two of 32 Ss could name the 
critical figure or verbalize the rule by which 
they had been shocked. Typical of their 
replies when questioned were, “Maybe there 
was a pattern to it, but I don’t know what it 
was” and “I didn’t know what I did or saw 
would have anything to do with it.” This is, in 
fact, learning without awareness. Moreover, if 
the S were deliberately falsifying his report, 
then during the assessment series, after the 
shock applicator had been removed, he should 
have abandoned this tactic; frequency of 
selection of the critical stimulus would have 
returned to what it was at the baseline. It is 
evident from the data that this did not happen. 
Furthermore, the greater his awareness, the 
more information he has to act on and the 
more successful he should be; yet no relation 
was found between level of awareness and 
magnitude of experimental effect. 

Tf the theoretical assumptions of the study 
are correct, something like the following oc- 
curred. An anxiety reaction produced by the 
shock was conditioned to the stimulus figures. 
It seems likely that anxiety attached to all 
four figures rather than just to the critical 
figure because of their low discriminability at a 
low level of awareness. Upon presentation of 
the stimulus, S tensed, gasped, or wrapped a 
leg around the chair, regardless of which figure 
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was selected. The total conditioned stimulus 
was probably even more complex, consisting of 
a stimulus presentation plus a lapse of time of 
two seconds. Generally, anxiety reduction is 
assumed to follow upon withdrawal of the cues 
eliciting the anxiety. However, termination 
of the exposure period brought no visible end 
to the S’s malaise. It was not until two seconds 
had elapsed and no shock had occurred that 
signs of relaxation or a relieved and grateful 
sigh appeared. In the absence of any good meas- 
ure of momentary changes in the anxiety level 
it is reasonable to think anxiety drive reduction 
and reinforcement of the prior perceptual 
response occurred at this point. 

This interpretation, which holds that the 
punished response is “weakened” only because 
competing responses are strengthened is 
advanced because it has worked well in describ- 
ing the avoidance learning of skeletal responses 
(9, 16). Logically, of course, the shift in relative 
perceptibility of these figures could be credited 
either to sensitization to the nonpunished 
stimuli or desensitization to the punished 
stimuli, or both. The data prove only that at 
least one of these reactions must have occurred. 
This ambiguity is shared, however, with 
studies in which a single stimulus is presented 
by the ascending method of limits. In studies 
following the latter design, the assessed 
strength of the veridical percept is relative to 
the strength of competing perceptual hypoth- 
eses (12). The forced-choice procedure simply 
delimits the range of competing perceptual 
responses to a high priority few. Until the 
evidence for sensitization and desensitization is 
less equivocal, perceptual vigilance and defense 
can legitimately be identified only with shifts 
in balance among competing perceptual 
responses, in the one case toward a critical 
percept, and in the other case against it. 

This study supports the view that perceptual 
defense and vigilance are learned reactions to 
anxiety arousing stimuli. It is consistent with 
Postman’s recent statement that “the facts 
which give rise to this concept [of perceptual 
defense] can better be subsumed under other, 
more general, principles of perception” (12, p. 
298); the same could be said for vigilance. 
Both concepts, it is suggested, may be sub- 
sumed under more general principles of per- 
ceptual learning, but this statement is not to be 
construed as asserting that perceptual defense 
and vigilance are therefore explained away. 
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Though the mechanisms appear to follow more 
general principles of learning, they nevertheless 
retain some identity as mechanisms. 


SUMMARY 


This study experimentally produces the 
learning of perceptual defense and vigilance 
and proposes a behavior theory analysis of the 
learning process. According to this analysis, 
perceptual defense is learned when the per- 
ceptual response to a threatening stimulus is 
punished and competing perceptual responses 
are instrumental to anxiety reduction. Com- 
peting perceptual responses when reinforced 
are strengthened at the expense of the critical 
perceptual response. Perceptual vigilance is 
learned when the perceptual response to a 
threatening stimulus is reinforced by anxiety 
reduction and competing perceptual responses 
are punished. 

Avoidance learning procedures were em- 
ployed to change the comparative recog- 
nizability of four geometrical figures presented 
simultaneously below the threshold for aware- 
ness. Sixteen Ss underwent defense training, 
another 16 vigilance training. Before each 
training series, the comparative recogniza- 
bility of the four figures was assessed for each S$ 
by a forced-choice technique. In both training 
series, S reported on each trial the position of 
the figure he found clearest. During defense 
training, if he selected the critical figure, he 
was shocked. If he selected one of the other 
three figures, he avoided the shock. During 
vigilance training, selection of the critical 
figure was instrumental to shock avoidance 
while selection of any of the other three figures 
incurred the shock. After each training pro- 
cedure, comparative recognizability of the 
figures was assessed again. Fourteen of 16 
defense Ss found the critical figure less recog- 
nizable after training than before. Thirteen of 
16 vigilance Ss found the critical figure more 

recognizable than before training. Learning in 
both cases proceeded in the absence of 
awareness. 
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THE RELATIVE AVERSIVENESS OF WARNING SIGNAL AND 
SHOCK IN AN AVOIDANCE SITUATION 


MURRAY SIDMAN anp JOHN J. BOREN! 
Walter Reed Army Institute of Research 


T 1s a widespread clinical and personal 
observation that the anticipation of pain 
may be worse than the pain itself. Yet the 

phenomenon has received little experimental 
attention. Possibly relevant observations are 
those of Gwinn (2) and Solomon, Kamin, and 
Wynne (18), who found that extinction of 
avoidance behavior was not hastened when the 
avoidance response was punished by shock. 
Kamin (4) demonstrated that a response which 
terminates a warning stimulus can be condi- 
tioned even when the response is regularly 
shocked. Escape from the warning stimuli was 
possibly sufficiently reinforcing to counteract 
the effects of punishment. Hill, Kornetsky, 
Flanary, and Wikler (3) found that shock 
intensities were overestimated by subjects 
who were not familiarized and reassured with 
respect to “the potentially fear-inspiring 
experimental situation” prior to the 
experiment. 

A more direct experimental demonstration 
would consist of an arrangement that permits 
objective measurement of the subject’s prefer- 
ence, either for a noxious event or for an 
anticipatory situation prior to the noxious 
event. The present study demonstrates one 
such arrangement in the framework of a 
modified avoidance conditioning procedure. 
In the classical avoidance experiment, the 
avoidance response prevents a shock and 
terminates the anticipatory warning signal. 
Shock avoidance and escape from the antic- 
ipatory situation are tied together. In the 
present modification of the classical technique, 
the subject can behave in either of two alter- 
native fashions: (a) postpone the shock, while 
at the same time maintaining the warning 
signal or (b) take the shock while simulta- 
neously terminating the warning signal. The 
shock can be avoided only at the cost of pro- 
longing the conditions that require the antic- 
ipatory avoidance behavior. The anticipatory 
situation can be terminated only at the cost of 
receiving a shock. 


1 Now at the Merck Institute for Therapeutic Re- 
Search, 


EXPERMENT I 


Subjects and Apparatus 


The Ss were 4 male albino rats. The lever-pressing 
apparatus, with its associated automatic programming 
and recording devices, has been described previously 
(12, 14). In brief, the experimental chamber was a 
metal box with a grid floor and a sloping plexiglass 
ceiling. Depression of a lever, which was inserted 
through one wall of the chamber, constituted the 
measured avoidance response. A small neon pilot light 
was employed as the stimulus. The behavior was 
recorded on magnetic counters and cumulative re- 
corders. 


Procedure 


Avoidance conditioning. The avoidance response 
(lever-pressing) was first conditioned by means of a 
previously described “nondiscriminated” avoidance 
procedure (11). Brief shocks, of a fixed duration (ap- 
proximately 0.3 sec.), were delivered through the grid 
floor every 20 seconds. Each time the animal depressed 
the lever, however, the shock delivery was postponed 
an additional 20 seconds, measured from the lever 
depression. Only the downward movement of the lever 
postponed the shock; continued holding of the lever 
did not affect the time of shock delivery. 

The interval by which each response delays the 
shock has been termed the response-shock (RS) inter- 
val, A response-shock interval of 20 seconds, with no 
stimulus light present, is referred to as (RS20”)P. The 
superscript “D” indicates the absence of the stimulus 
light. 

Chained avoidance contingencies under stimulus 
conirol. After the rate of avoidance responding became 
reasonably stable on the shock schedule, (RS20”)P, 
the following procedure was instituted. If the animal 
permitted 15 seconds to elapse without a lever-pressing 
response, a stimulus light (L) would come on. Every 
lever press in the absence of the stimulus, however. 
postponed the onset of L for an additional 15 seconds, 
Tn the absence of L, then, there is a response-light (RL) 
interval instead of a response-shock (RS) interval. 
This is referred to as (RL15 ’)P, to indicate that each 
response in the absence of the light postponed the 
onset of the light for 15 seconds. 

When the animal permitted the light to come on, a 
different schedule went into effect. If there was no 
response in the presence of the light a shock occurred 
within five seconds and the light terminated, rein- 
stating (RL15”)P. Each response in the light, however, 
postponed the shock for five seconds. The light would 
remain on until the animal permitted the shock to 
occur. Thus, in the presence of the light there is an RS 
interval of five seconds, which may be denoted as 


(Rss): 
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Fic. 1. DIAGRAMMATIC REPRESENTATION OF THE 
EXPERIMENTAL PROCEDURE (SEE 
Text FOR Deras) 


The total procedure may, for convenience, be 
represented symbolically as: (RL15”)P — 
(RS5”)t, S— (RL15”)P . . . . This may be read 
as follows: “In the absence of the light, the 
response-light interval is 15 seconds. If the 
animal permits the RL interval to elapse this 
produces the light, in the presence of which a 
response shock interval of 5 seconds is in force. 
Occurrence of a shock terminates the light and 
reinstates the response-light interval of 15 
seconds.” Fig. 1 depicts the schedule, 
(RL10”)P — (RS5”)t, S > (RL10”)P. (In the 
present experiment, the RL interval was 15 
sec.) 

Each experimental session lasted six to eight 
hours, with a given animal running no more 
frequently than once every alternate day. 
Responses were recorded separately in the 
presence and in the absence of the light. 


RESULTS 


Results for the four Ss are presented in Fig. 
2. The control curves in the left-hand frames 
represent the average response rates during the 
final sessions of avoidance conditioning, in 
which the prevailing shock schedule was 
(RS20”)P. The curves in the right-hand frames 
represent the rate of responding in the light 
and in the absence of the light following the 
change in procedure. In three of the four cases 
the response rate in the stimulus light displayed 
an initial increase over the rate in (RS20”)p, 
but in all cases the rate gradually dropped to a 
level well below the control rate. The response 
rate in the absence of the light did not reveal 
any marked trend from session to session that 
was consistent among all the Ss, but, in all 
cases, ended up higher than the rate during the 
warning stimulus. 


DISCUSSION 


On the basis of previous findings, we might 
have expected a high rate of avoidance re- 
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Fic, 2. RATE OF AVOIDANCE RESPONDING IN THE LIGHT 
(L) AND IN THE ABSENCE OF THE 
Licut (D) FOR 4 ANMALS 


The curves on the left are for the nondiscriminated 
avoidance procedure, and the curves on the right are 
for the chaining procedure. Session 1 on the chaining 
procedure followed immediately after the last session 
on the nondiscriminated procedure. 


sponding in the presence of the warning 
stimulus and a low rate in its absence. This 
outcome would have been consonant with the 
findings of Mowrer and Lamoreaux (8, 9) ™ 
the classical avoidance situation and with 
those of Sidman in a situation in which the 
avoidance responses could postpone the wart 
ing stimulus (14). It has also been show? 
previously (6, 12, 13) that the rate of avoidan©® 
responding is an inverse function of phe 
response-shock interval, within certain limits: 
On this basis alone, a higher response T° 
during the warning stimulus would be P™ 
dicted, when the response-shock interval Ka 
only five seconds, than during the coit 
phase, when the response-shock interval W 
20 seconds. w- 
In none of the previous experiments, ho 
ever, was the S faced with the dilemm 


WARNING SIGNAL AND SHOCK 


choosing between the warning stimulus and 
the shock. The response that avoided the 
shock was also responsible for termination of 
the warning signal. The present case, in which 
such a choice was required, produced behavior 
whose pattern was diametrically opposite to 
that which is customarily observed. The rate 
of avoidance responding was low in the presence 
of the warning stimulus and high in its absence. 
This was the pattern to be expected if the 
shock was preferable to the signal. For while 
lever pressing during the warning stimulus did 
succeed in postponing the shock, it also 
served to prolong the stimulus. Behavior other 
than lever pressing, however, was the only 
means of terminating the warning stimulus. 
The low rate of lever pressing indicates that 
other behavior did, indeed, become prepotent, 
even though an inevitable shock was one of the 
consequences of such behavior. 


EXPERMENT II 


If the decline in the rate of lever-pressing 
during the stimulus was due to the reinforce- 
ment of competing behavior through the 
termination of the stimulus, it should be pos- 
sible to restore the lever-pressing rate by 
terminating the light independently of any 
behavior. This was accomplished by permitting 
the stimulus light to remain on for five minutes, 
regardless of whether or not the animal pressed 
the lever. 


Subjects and Apparatus 


The Ss were five male albino rats. The apparatus 
was the same as in Experiment 1, except for the shock 
power supply. In this experiment an Applegate shock 
generator set at maximum output was used. At this 
setting, the shock varied from 3 to 5 ma. 


Procedure 


Shock terminates stimulus. This procedure was the 
same as the “chained avoidance contingencies under 
stimulus control” procedure described in Experiment 
J. The salient feature here is the termination of the 
stimulus coincidentally with the delivery of the shock. 
‘All animals were run on the schedule, (RL20’)? — 
(RS10")", S > (RL20”)P ... for a total of 100 hours. 
Rat AA-1 was also run on the schedule, (RL20”)? > 
(Rs20")4, S > (RL20")2...for 100 hours. Rat 
GF-81 was also run for only one session on the schedule, 
(RL20")2 — (RS5”)*, S > (RL20”)P. . . . Unfor- 
tunately, this animal died on the day following this 
session, but his data were included with the others. 

Five-minute stimulus duration. On this procedure, 
whenever the animal permitted the stimulus to appear 
it remained on for five minutes, regardless of the number 
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of responses emitted or the number of shocks delivered. 
Termination of the stimulus was based upon the passage 
of time alone, and was independent of the animal’s 
behavior. In terms of our symbolic notation, this pro- 
cedure might be expressed as (RL20”)P + (RS10")¥, 
(5-min.)! — (RL20")?. . . . 


The RL and RS intervals for each animal 
were the same as those described above in the 
“shock terminates stimulus” procedure. Every 
animal was run for 100 hours on each schedule. 
On the procedure involving (RL20’)> — 
(RS10”)#, rats GF-81 and AA-1 ran first on the 
“shock terminates stimulus” procedure, after 
which they were changed to the “five-minute 
stimulus duration” procedure. All the other 
animals, along with GF-81 and AA-1 on the 
procedures that involved (RSS5’)" and 
(RS20”)4, respectively, ran on the “‘five- 
minute stimulus duration” procedure first. 


RESULTS 


Fig. 3 and Fig. 4 give a session-by-session 
picture of the response rates under both pro- 
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Fic. 5. RATE or ÅVOIDANCE RESPONDING IN 
THE PRESENCE OF THE LIGHT WHEN THE 
LIGHT TERMINATES WITH THE SHOCK 

AND WHEN THE DURATION OF 
THE LIGHT Is INDEPENDENT 
OF THE SHOCK 


The RL interval is 20 sec. in all cases, 


cedures for two of the animals. Rat AA-1, 
which started on the “shock terminates 
stimulus” procedure, replicated the data pre- 
sented in Experiment I. The response rate in 
the presence of the stimulus fell off relative to 
its own initial level and to the rate in the 
absence of the stimulus. When the stimulus 
duration was fixed at five minutes and was, 
therefore, independent of the animal’s þe- 
havior, there was a marked increase in the rate 
of responding during the stimulus. Rat AC-39, 
started on the “five-minute stimulus duration” 
procedure, showed essentially the same results, 
When the stimulus termination was made 
dependent upon failure to press the lever and 
receipt of a shock, the response rate in the 
stimulus dropped markedly. In both animals 
there was only a slight change in response rate 
in the absence of the stimulus. 

A summary of the data for all animals is 
presented in Fig. 5. This figure shows the 
median response rate in the stimulus over the 
last four sessions of each procedure for each 
animal. In all cases the RL interval was 20 
seconds. The RS intervals employed are in- 
dicated beneath the corresponding bars of the 
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graph. Of the five cases in which the RS 
interval was 10 seconds, the rate during the 
stimulus was higher when the stimulus duration 
was independent of the animal’s behavior. In 
the case of GF-83, however, the difference was 
too small to be taken seriously, and this animal 
must be considered an exception. 

When rat GF-81 was run on an RS interval 
of five seconds there was again a striking differ- 
ence in rate between the two procedures. 
must be remembered, however, that GF-81 
had only one session on (RS5”)t and that the 
tates depicted here are undoubtedly not the 
final levels that would have been reached } 
additional sessions could have been run. 
Most interesting finding is that yielded by Rat 
AA-1 when the RL and RS intervals were both 
20 seconds. In this case, it made little differenc? 
whether or not the stimulus duration depende 
upon the S’s behavior. Response rates in thé 
Presence of the stimulus were essentially th 
same in both procedures, 

The effect of the procedural differences upo? 
the response rate in the absence of the stimulu 
1s shown in Fig. 6. With one exception, the 1% 
on the five-minute stimulus duration proce 
was higher, but in no case were the differen 
for a given animal as great as those display’ e 
by the response rates in the presence 0 | p 
stimulus depicted in Fig. 5. The one except? 
GF-81 at an RS interval of five gecir al 
Probably does not represent a true stable le 
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of responding because only one session was 
included. 


Discussion 


These data substantiate the view that the 
decline in lever-pressing rates during the 
Stimulus, observed in Experiments I and II, 
is due to the reinforcement of nonlever-pressing 
behavior by the termination of the stimulus; 
this, in spite of the fact that a shock accom- 
banied stimulus termination. When thestimulus 
duration was fixed at five minutes, regardless of 
the animal’s behavior, the response rate in the 
presence of the stimulus increased. In this pro- 
cedure it was impossible for the animal to “take 
the shock in order to terminate the stimulus,” 
and the lever-pressing rate assumed a value 
consistent with the prevailing response-shock 
interval. 

Most current explanations of discriminated 
avoidance behavior attribute the maintenance 
of the avoidance response to a “drive” or 

aversive” function acquired by the warning 
stimulus as a result of its temporal relation to 
the shock (e.g., 5, 7, 10, 17). Termination of the 
stimulus is held to provide reinforcement for 
the avoidance response. The present data seem 
to indicate that the stimulus-shock pairings 
actually make the warning stimulus more 
aversive than the shock from which it derived 
its function. But the data of Rat AA-1 do not 
support the suggestion that the aversive func- 
tion of the warning stimulus derives simply 
from its association with the shock. Rather 
than dismiss the performance of AA-1 as an ex- 
ample of unexplained variability, we may view 
the apparent exception with greater than ordi- 
nary interest. Rat AA-1 displayed the expected 
difference in responding under the two pro- 
cedures when the RL interval was 20 seconds 
and the RS interval was 10 seconds. There was 
no large difference, however, when both 
intervals were equal. In spite of the fact that 
the warning stimulus was inevitably paired 
with shock, there is little indication that the 
stimulus was more aversive than the shock 
when RL and RS intervals were each 20 sec- 
onds. Some factor in addition to the stimulus- 
shock pairings must be involved in determining 
the relative aversiveness of stimulus and shock. 

Sidman and Boren have suggested (16) that 
the discriminated avoidance situation may be 
considered as a “multiple schedule” (1) in 
which one set of avoidance contingencies pre- 
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vails in the presence of the warning stimulus 
and another in its absence. In the present ex- 
periments, the two sets of avoidance contin- 
gencies may be specified in terms of the RL and 
RS intervals, the latter prevailing in the pres- 
ence of the warning stimulus, and the former in 
its absence. When both intervals were 20 sec- 
onds, there was no great advantage to the ani- 
mal in taking the shock and terminating the 
stimulus, for the temporal contingencies were 
the same in the presence and absence of the 
stimulus. A relatively high rate of lever press- 
ing was therefore maintained in the stimulus, 
and the shift to a fixed stimulus duration pro- 
duced no appreciable change. (The fact that 
this situation obtained when both intervals 
were numerically equal may be regarded as 
fortuitous. It is possible that for other animals 
a much larger difference between the RL and 
RS intervals would be required to produce a 
decline in rate during the stimulus.) Other ex- 
periments have served to confirm this explana- 
tion. It was found that the aversiveness of the 
warning signal is a function of the RS interval 
that prevails in the presence of the signal, of the 
RL interval that prevails in its absence, and of 
the difference between the two (15). It is neces- 
sary to take into account not only the antic- 
ipatory situation from which the S escapes, 
but also the conditions that obtain subsequent 
to the escape. If the latter are relatively 
favorable, the anticipatory signal may actually 
become more aversive than the shock itself. 


SUMMARY 


An avoidance situation was arranged in 
which the animals could either (a) postpone the 
shock and prolong the warning signal or (b) 
take the shock and terminate the warning 
signal. With this procedure, we observed a low 
rate of avoidance responding in the presence of 
the warning signal and a high rate in its 
absence. The animals behaved in such a way 
as to terminate the stimulus as quickly as pos- 
sible, even though a shock accompanied each 
stimulus termination. When the signal duration 
was made independent of the animal’s be- 
havior, the rate of avoidance responding in the 
stimulus increased. It was suggested that the 
avoidance contingencies in the presence and 
absence of the warning stimulus are important 
determiners of the relative aversiveness of 


stimulus and shock. 
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SELECTIVE RECALL IN SCHIZOPHRENIA AND ITS 
RELATION TO EGO STRENGTH: 


ARTHUR S. TAMKIN? 
Duke University 


HE PURPOSE of this study is to investi- 

gate the nature of selective recall in 

schizophrenic patients. The hypotheses 
offered are that schizophrenics manifest a dis- 
tinctive type of memory process in contrast to 
normal subjects, resulting in the preferential 
remembering of experiences connoting personal 
failure and diminished self-esteem, and that 
this process may be related to deficient ego 
strength. 
_ This characteristic memory process may be 
inferred both from clinical observations of 
Schizophrenics and from theoretical considera- 
tions of schizophrenia. Certain aspects of its 
symptomatology, suggesting selective memory 
functioning, have been described as brooding 
Preoccupations (4), chronic feelings of inse- 
curity and inadequacy (20), and reaction 
sensitivity to events which reduce self-esteem 
(6). Cameron has explained this symptoma- 
tology as follows: “In the childhood of both 
paranoid and schizophrenic patients . - - there 
have been developed self attitudes which leave 
the individual unusually reaction sensitive to 
signs that he may be guilty, inferior, or un- 
worthy in the eyes of others” (5, p. 489). 

Although in clinical descriptions and theoret- 
ical discussions pathological selective remem- 
bering has been designated by different terms, 
the common implication is that the direction 
of thinking and remembering in the schizo- 
phrenic is determined by his peculiar feelings, 
needs, and emotional complexes. It seems that 
he does not passively absorb impinging stim- 
uli and recall his experiences indiscriminately ; 
rather, directive intrapsychic forces predispose 
him to respond to and retain material which is 
congruent with his pathological motivations. 
In normal functioning, selectivity in both 


1 This paper is based on a Ph.D. dissertation at 
Duke University. The author wishes to thank Drs. 
Louis D. Cohen, George H. Collier, and Gregory 
Kimble for their invaluable assistance on this project, 
and Dr. Burke M. Smith of the Roanoke Veterans 
Administration Hospital who made available patients 
and facilities. 

2Now at the Veterans Administration Hospital, 
Northampton, Mass. 


perception and memory has been demonstrated 

by a variety of experimental methods, and it 

has been related to basic needs and defensive 

mechanisms of the personality. Personal needs 

and values, for example, have been shown to 

enhance the perceptual recognition of need- 

and value-related words (14), and the prefer- 

ential perception of tasks associated with 

failure and success has been shown to be de- 

termined by individual defensive mechanisms 

(9, 15). Studies of selectivity in memory are 

represented by a large body of literature re- 

viewed in several publications (3, 16). The 

general findings which have emerged are that 
memory normally functions in a selective 
manner, enhancing retention of material with 
which the person is in agreement (7), which is 
congruent with his value-orientation (8), and 
which has pleasant affective connotations (12). 

A method for measuring selective recall 

which has enjoyed wide use is the recall of 
completed and incompleted tasks, introduced 
by Zeigarnik (22) in 1927. This method is 
quite suitable for the purposes of this investi- 
gation, for when it is applied as a simulated 
intelligence test in order to enhance the ego 
involvement of the subjects, the experimenter 
may systematically manipulate the subjects’ 
experience of self-esteem through success and 
failure on the tasks. Moreover, motives related 
to the defense of the ego or other psychologi- 
cally significant adaptive responses may be 
evoked. The method of completed and incom- 
pleted tasks creates a realistic situation which 
maintains the subjects’ interest by simulating 
a familiar kind of competitive situation, and is, 
therefore, a convincing experimental analogue 
of an emotionally arousing experience which 
has real personal significance for the partici- 
pants. Alper (2) has demonstrated that when 
the completed and incompleted task method is 
used under ego-involving conditions, individ- 
uals who recall a preponderance of successes in 
this threatening situation are characterized by 
a “strong ego,” whereas those who recall a 
preponderance of failures have a “weak ego.” 
Several subsequent studies have shown that 
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normally the completed tasks are selectively 
retained (1, 11, 17), and others have confirmed 
the relationship between this tendency and 
level of ego strength (9, 10, 13). 

When completed and incompleted tasks are 
presented to schizophrenics, one may hypothe- 
size (a) preferential recall of the incompleted 
puzzles and (b) a relation between this tend- 
ency and deficient ego strength. These specific 
hypotheses were tested by the experiment 
reported here. 


METHOD 


In order to test the hypothesis that schizo- 

phrenics manifest selective recall of failure ex- 
periences, their recall of completed and 
incompleted puzzles which were administered 
ostensibly as an intelligence test was analyzed. 
An incidental rather than an intentional (set- 
to-learn) learning situation was utilized be- 
cause of its superiority for the demonstration 
of selective remembering (1). The condition of 
ego involvement, created by simulated intelli- 
gence testing, was established in order to 
enhance the probability that incompleted 
puzzles would connote failure, thus presenting 
a threat to self-esteem. The second hypothesis, 
which states that the selective recall of schizo. 
phrenics is related to deficient ego strength, 
was explored with the Scott-Duke Question- 
naire (18), designed to measure the trait of 
self-acceptance, a personality characteristic 
presumably related to level of ego strength. 
An inverse relation between scores on the 
Self- acceptance scale of this questionnaire and 
a preponderant recall of incompleted puzzles 
would be considered confirmative of the second 
hypothesis. 


Subjects 


An experimental and a control group of 24 Ss each 
were used in the experiment. The experimental group 
was composed of male schizophrenics, patients at the 
Veterans Administration Hospital, Roanoke, Virginia, 
randomly selected on the basis of availability for 
psychological testing and ability to cooperate on the 
testing procedures. Four Ss had to be eliminated be- 
cause of failure to cooperate. The diagnoses were made 
by the psychiatric staff. All patients used were recent 
admissions who had received no psychiatric treatment 
during their current hospitalization at the time of the 
experiment. The control group consisted of 24 adult 
males who had never received psychiatric treatment 
and who claimed that they did not require such treat- 
ment. They were considered to represent a group of 
normal individuals. They were selected according to 
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TABLE 1 
VARIABLES ON WHICH THE SCHIZOPHRENIC 
AND CONTROL GROUPS WERE MATCHED 


hizophrenics ontrols 
Viriibie Schizophrenic: Ci ttt 
N Mean N Mean 
Education = 892 — 925 A 
Age — 30.08 — 29.75 -18° 
Negro 4 4 
White 20 20 


Note.—Subjects of both groups resided in the same locality 
and all were males. 
a p is greater than .20. 


criteria of color, age, education, sex, and geographical 
residence so as to yield a group comparable to the 
experimental group for these personal variables. The 
data are presented in Table 1. 

The members of the control group were obtained 
from three separate sources in response to a solicitation 
to donate one hour of time to an unspecified research 
project. Eleven Ss volunteered at a labor union meeting, 
ten volunteered at a state employment office for & 
monetary incentive, and three were obtained while 
waiting for relatives at an outpatient clinic. 


Materials 


The materials for the completed and incompleted 
tasks consisted of 10 jigsaw puzzles, each depicting 4 
common animal with its name printed conspicuously 
along the border. The animals were kangaroo, giraffe, 
parrot, bear, rooster, eagle, swan, tiger, geese, an 
elephants, 

The Scott-Duke Questionnaire comprises the L and 
K scales of the MMPI, which are used as a check 0n 
validity, and a Self-acceptance scale. It is a self- 
administered, true-false type of questionnaire, score 
by means of an objective key. According to Scott (18), 
the Self-acceptance scale was based upon Sheerer’s 
definition of the self-accepting person, and it was Show? 
to have a high reliability and validity as a measure © 
self-acceptance. The split-half reliability was report 
by Scott as .94. Its validity was predicated upon it 
correlation of .77 (.83, corrected for attenuation) will 
Rogers’ Q-sort technique with 58 Ss. The empiric 
finding that it differentiated between a variety a 
Psychiatric disorders ranging from the psychoneuros© 
to schizophrenia, which yields the lowest score, a8 WË 
as its relation to the Q-sort measure of self-acceptance 
tends to justify its use in this experiment as an estima 
of ego strength, 


Experimental Procedure 


All Ss in both the experimental and control 8 
underwent an identical procedure, but only 17 S$ 
each group, matched for the personal variables, rec ai 
the Scott-Duke Questionnaire. The procedure © is 
sisted of the following sequence of individually ad 
tered tests: a vocabulary test, assembly of J8° 
Puzzles, digit span test, test for recall of jigsaw Fary 
zles, and the Scott-Duke Questionnaire. The voca Pr jate 
and digit span tests were introduced in order to simi nd 
more plausibly an intelligence testing condition, 
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the instructions given to both groups were worded 
appropriately. The ten jigsaw puzzles, constituting 
the memory material, were administered in a random 
Sequence. The order of completions (C) and incomple- 
tions (I) was identical for each S in the sequence 
ICCCIICIIC. The Ss, timed for their performance 
on each puzzle, were allowed to complete every one 
designated as a C puzzle, and were prevented from 
completing every I puzzle by the imposing of an ar- 
bitrary time limit just before the last two pieces were 
fitted into the partially assembled puzzle. After com- 
pleting each C puzzle, the S was asked to tell what the 
picture was if he did not do so spontancously. In order 
to ensure equal exposure time to all puzzles, the I 
Puzzles, which were finally completed by the experi- 
menter, were exposed to the S’s view, and he was 
asked to state what the picture was. Immediately after 
the interpolated digit span test, each S was asked to 
recall the names of all the jig-saw puzzles which he 
could remember. 


RESULTS 

The chi-square test was used to answer the 
question whether any distinctive pattern of 
recall was associated with the schizophrenics 
as compared with the control Ss. The Ss of 
cach group were divided into three categories 
according to their recall patterns: (a) the recall 
of a preponderance of I puzzles (I), (b) recall of 
a preponderance of C puzzles (C), and (c) re- 
call of an equal number of I and C puzzles 
(Equal). The first two patterns represent dif- 
ferent types of selective recall, while the third 
Pattern shows no selectivity. The distribution 
of Ss falling into each category is shown in 
Table 2. Chi square for the association of recall 
Pattern with psychiatric status (normal or 
schizophrenic) was 6.89 (.05 > p> .02). It 
may be observed that while a majority of the 
schizophrenic group demonstrated the I pat- 
tern, a minority of the control group also did 
so. The majority of the control group was dis- 
tributed evenly between the Equal and C 
Patterns. 

In order to determine the relationship be- 
tween scores on the Self-acceptance scale and 
the three recall patterns, all Ss were sorted 
according to their recall patterns and an analy- 
sis of variance of their Self-acceptance scores 
Performed, The obtained F of .32, as shown in 
Table 3, is not significant. 

Since Self-acceptance scores were unrelated 
to the recall patterns, evaluations were made 
Of the reliability and applicability of the scale 
3 the assessment of ego strength with the Ss 
$ this experiment. Internal consistency was 

timated by the split-half method, using the 
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TABLE 2 


FREQUENCIES OF SUBJECTS DEMONSTRATING 
Each Type or RECALL PATTERN 


I Equal C 

Schizophrenics 15 4 5 

Controls ó 9 9 
TABLE 3 


SUMMARY OF ANALYSIS OF VARIANCE OF 
SELF-ACCEPTANCE SCORES 


Sı í Sı í Mean 

SS Jaj smes | SAS SE 
Between groups | 2 31.68 | 15.84 | .32 | >.05 
Within groups | 31 | 1544.08 49.81 
Total 33 | 1575.76 


Spearman-Brown formula. A coefficient of .90 
was obtained for the 34 Ss in the combined 
groups. This coefficient, similar to that of .94 
reported by Scott, is high enough to suggest 
that the scale has high internal consistency. 
While a more meaningful evaluation of reli- 
ability would involve the question of the 
stability of the measurements over time, test- 
retest data were not available to permit this 
type of evaluation. With regard to the appli- 
cability of the Self-acceptance scale in estimat- 
ing ego strength, on a priori grounds it should 
differentiate between hospitalized schizophren- 
ics and normal persons. The mean Self-accept- 
ance score of the control group was 30.65, and 
of the schizophrenic group 25.12. A t of 2.96 
was significant at the .01 level, but since there 
was considerable overlap between the two 
distributions, its effectiveness in discrimination 
was rather low. 


DISCUSSION 


While the hypothesis that schizophrenics 
selectively recall experiences connoting per- 
sonal failure and diminished self-esteem was 
supported by the experimental findings, the 
hypothesis of the relation of this memory 
process to deficient ego strength was unsup- 
ported. The failure to confirm the second hy- 
pothesis may be attributable, in part, to two 
possible sources of error: one having to do with 
deficiencies in the instrumentation employed, 
the other involving certain weaknesses of the 
concept of ego strength. 

The estimation of ego strength by means of 
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the Self-acceptance scale seemed tenable in 
view of its demonstrated relationships to psy- 
chopathology and to an independent, ob jective 
measure of self-acceptance. However, its 
power of discrimination proved to be quite low 
owing to the degree of overlap of scores in the 
schizophrenic and normal groups. Further- 
more, while high split-half reliability was 
demonstrated, the question of the stability of 
measurement over time was not evaluated ow- 
ing to the unavailability of test-retest data. 
This would seem to be the more important 
consideration in establishing the level of reli- 
ability because of the probable susceptibility 
of the schizophrenics to transient influences 
and momentary moods which would make their 
responses to the Self-acceptance scale unde- 
pendable. In view of these sources of experi- 
mental error, one may question the effective- 
ness of the Self-acceptance scale as an estimate 
of ego strength. 
Another factor which might be responsible 
for the finding of no relationship, in spite of 


the positive findings of other experimenters 
who used different instruments and normal Ss, 


is that due to ambiguous operational and con- 
ceptual definitions of ego strength, different 
facets of this trait are measured by the various 
instruments employed. The findings of a recent 
study comparing several independent measures 
of ego strength have drawn attention to this 
ambiguity on empirical grounds (21). Because 
ego strength, defined by Eriksen as “the indi- 
vidual’s capacity for appraising the reasonable 
limits in his interpretations and perceptions of 
his environment” (10, p. 46) and measured by 
Rorschach F+ %, has been shown to correlate 
significantly with selective recall, this is no 
indication that ego strength measured by the 
Self-acceptance scale should. This considera- 
tion argues the need for a more precise defini- 
tion of the presently vague concept of “ego 
strength.” Certainly, the negative finding ob- 
tained in this experiment points up the need 
for continued research on the problems of ego 
strength and its role in schizophrenia. 


SUMMARY 


Two hypotheses concerning certain aspects 
of schizophrenic symptomatology were tested: 
(a) Schizophrenics manifest a distinctive type 
of memory process which results in the selective 
recall of experiences connoting personal failure 
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and diminished self-esteem and (b) this process 
is related to deficient ego strength. The method 
used to test the first hypothesis was to admin- 
ister jigsaw puzzles as a simulated intelligence 
test, allowing each S to complete only half of 
the group of puzzles, and to determine whether 
there occurred selective recall by the schizo- 
phrenics for the incompleted puzzles connoting 
personal failure. As a test for the second hy- 
pothesis the Scott-Duke Questionnaire, which 
contained a Self-acceptance scale and two 
validity scales from the MMPI, was admin- 
istered to provide a measure of ego strength, 
and its power of discrimination between 5S 
grouped according to their recall pattern was 
ascertained. The Ss consisted of 24 schizo- 
phrenic patients who were recently admitted 
or readmitted for hospitalization and a control 
group of 24 normal adults matched for age, 
education, sex, color, and geographical resi- 
dence. While the first hypothesis was confirmed 
at a significant level of confidence, the second 
was not supported by the experimental data. 
The failure to confirm the second hypothesis 
seemed to be attributable, in part, to two 
sources of error; one arising from deficiencies 
in the instrumentation, and the other from 
certain weaknesses of the concept of ego 
strength. It is conceivable that due to the 
vague operational and conceptual definitions 
of ego strength, different facets of this trait 
may be measured by the various instruments 
which have been employed. It is believed that 
the negative finding concerning the second 
hypothesis points up the need for continue 


research on the problem of ego strength and its 
role in schizophrenia. 
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EFFECTS OF PERCEIVED DEPENDENCY RELATIONSHIPS 
UPON CONFORMITY TO GROUP EXPECTATIONS! 


LEONARD BERKOWITZ 
University of Wisconsin 


WO PREVIOUS papers have shown that 

(a) groups evaluated as a unit become 

more highly motivated than groups 
whose members are evaluated separately (1), 
and (6) the individually evaluated groups that 
develop the highest task motivation are those 
whose members initially are highly motivated 
and then evaluated unfavorably (2). In both 
cases it was suggested that the final high task 
motivation resulted from a shared perception of 
group member interdependence for reward 
attainment. The present experiment provides 
a direct test of this hypothesis.2 In addition, an 
attempt will be made to demonstrate that the 
hypothesized relationship is a special case of a 


1 This study was supported by funds granted by 
the University Research Committee of the University 
of Wisconsin. The author would like to acknowledge 
Mr. Sidney Willmuth’s invaluable assistance in the 
collection of the data. 

? There is some evidence providing tentative support 
for the hypothesis. Deutsch (3) has shown, for example, 
that cooperative groups are more highly task motivated 
than competitive groups. The individuals in the co- 
operative groups are described as being “promotively 
interdependent” in that any member’s instrumental 
behavior could advance all members toward the goal, 
Deutsch’s results would appear to indicate, then, that 
these interdependent Ss were more highly motivated 
toward the assigned group task than the less promo- 
tively interdependent members of the competitive 
groups. However, the obtained differences may not be 
due to the degree of perceived promotive interdepend- 
ence per se. There is a possibility that the high group- 
task motivation was produced by one or more of the 
by-products of the cooperative relationship, such as 
the high group cohesiveness, either directly or in 
interaction with the perceptions of member interde- 
pendence. 

A somewhat similar difñculty is presented by a 
second related study. Pepitone (8) found that Ss who 
had been told their jobs were important to the group 
were more highly task motivated than Ss who had 
been told their jobs were of relatively little importance, 
Since group members generally are dependent upon 
the behavior of those with important jobs, the Ss’ task 

motivation apparently was directly affected by their 
belief that others in the group were dependent upon 
them. However, there also are alternative Possibilities, 
It is conceivable that ego-enhancing motives were 
aroused by £’s description of S’s job as being important 
to the group, and that these motives affected Pepitone’s 
results. 


more general relationship between perceived 
member interdependence and motivation to 
conform to relevant group expectations. 

Several components of perceived inter- 
dependence must be considered in accounting 
for its effect upon group-task motivation. The 
writer has proposed elsewhere (2) that the 
degree to which an individual feels dependent 
upon others is a joint function of two factors! 
the judged probability that his progress toward 
his goal is dependent upon the activity of these 
others (perceived instrumentality) and the 
extent to which he values this goal} In all 
likelihood, perceived instrumentality and goal 
value affect the degree of perceived dependence 
regardless of the particular direction of the 
dependency relationship. Thus, assuming that 
each group member's task performance 1$ 
necessary for any single member's goal attam- 
ment, a subject (S) should see himself as being 
highly dependent upon his teammates when he 
is likely to gaina goal he values highly through 
group activity with them. He also shoul 
perceive his teammates as being highly de- 
pendent upon him when the group activity 
gives them access to a goal they would val 
highly, Any given member of an interdependent 
group probably has both of these perceptions; 
namely, that he is dependent upon the other 
for the attainment of his goals, and also tha 
they are dependent upon him for their 80% 
attainment. 

The primary motive for an individual wag 
can attain his task-associated goal indepen” 
ently of the others in the group (i.e, an © 
who perceives low instrumental interrelatio” 
ships among the group members) stems sole 
from the value of this goal to him. The mem v- 
of an instrumentally interrelated group, ie 
ever, has additional motives created by t 

° Helen Peak has made a similar analysis of = 


tudes (7). She has suggested that the position “at! 
Person takes on an attitude scale is some functio”. et 
(a) “The judged probability that the [attitude] objè 
leads to good or bad consequences [i.e., the perce? the 
instrumental relationship between the object 2” fect 
Perceiver’s goal], and (b) the intensity of the © 
expected from those consequences.” P. 154. 
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knowledge that others are dependent upon 
his task behavior for their rewards. 

This knowledge may affect the person’s task 
motivation in at least two different ways. 
First, if the others are in a position to become 
aware of his performance, he may feel that 
effective task behavior on his part is necessary 
to win their good will. Although the desire to 
Win the approval of others probably is an 
effective motivating force in itself, this desire 
should be particularly important when the 
Person is dependent upon the others’ task 
activity for his access to a valued goal, as is the 
Case in interdependent groups. Second, his 
effective performance may be necessary in 
order to conform to hisinteriorized standards of 
Proper social conduct. Since the group mem- 
bers in the experiment to be described observed 
each other’s behavior, it is not possible to test 
these alternatives unequivocally. However, 
whatever the reason, under the present condi- 
tions the task motivation of Ss should be 
affected by the perception that others in their 
group are dependent upon them. 

Several specific hypotheses can be formu- 
lated on the basis of the foregoing discussion. 
The first has to do with the perceived instru- 
Mental interrelationships among the group 
members. 1. Members of groups who are in- 
strumentally interrelated for reward attainment 
become more highly motivated toward the group 
task than Ss who can achieve their goals in- 
dependently of the others in the group. The 
remaining hypotheses are based upon the 
assumption that the value of the goal to S and/ 
or his teammates also affects the perceived 
dependency relationships. In highly inter- 
dependent groups the task-associated goal is 
likely to be valued highly by all group members. 
2. Under conditions of instrumental interde- 
pendence, Ss who are told that both they and 
their teammates can attain a valued goal through 
the group activity become more highly task 
motivated than Ss who are told that either only S 
or only S’s partner can gain this goal. At least 
Part of S’s task motivation in an interde- 
Pendent group stems from his knowledge that 
Others are dependent upon him for their 
attainment of a goal they would value highly. 
3. Given that the instrumentally interrelated 
group members are aware of each other's behavior, 
Ss instructed that only their partners have access 
to a valued goal through the group activity become 
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more highly task motivated than Ss working for a 
goal that is less valued by both themselves and 
their partners. 


METHOD 


Group Task and Measure of Task Motivation 


Two Ss, both male or female, were scheduled to- 
gether, ostensibly to work on a task assessing their 
“effective intelligence.” They were told that the object 
of the task was for each of them to transmit a list of 
nonsense syllables to the other as quickly as possible 
by means of a simulated telegraph system. Each S 
operated two telegraph keys controlling light signals 
on his partner’s desk, “dot” for one light, and “dash” 
for the other. Employing a telegraphic code supplied to 
each S, S-I sent the first nonsense syllable on his list 
to his partner, S-JZ, who decoded and copied the 
message. S-Z then sent the first syllable on his list to 
S-I, who in turn decoded and wrote out the partner’s 
message before sending his second syllable to S-II. 
This alternation was continued for 15 syllables by each 
S. The group members obviously were instrumentally 
interrelated because of the structure of the task. 

After completing a given nonsense syllable, each S 
flashed a third light on his partner’s desk. A trial is 
defined as the time in seconds taken by a given S to 
receive and decode his partner’s message and then 
send his own message. The 15 trials were divided into 
three blocks of five trials each for computational 
convenience. The Æ told the Ss that the first five trials 
were “for practice” and would not count toward their 
score. The total time for these trials in the first block 
provided for each S a baseline activity rate from which 
the time scores for the two remaining blocks were 
subtracted to yield two improvement scores. The latter 
scores, the dependent variable in this study, serve as 
an index of S’s motivation to work rapidly in the given 
block, i.e., his motivation to achieve the designated 
task objective. Analyses of the data disclose no signifi- 
cant relationship between an S’s improvement scores 
and those of his partner. 


Research Design and Experimental Manip- 
ulations 


Under the pretext that he was being interviewed 
for life history data, each S was individually given the 
task instructions appropriate to the condition to which 
he was assigned. Five conditions were established as 
follows: 

(a) Both S and his partner eligible for a prize, high 
instrumental interrelationship. 

(b) S alone eligible for a prize, high instrumental 
interrelationship. 

(c) S’s partner alone eligible for a prize, high instru- 
mental interrelationship. 

(d) No prizes to be attained, high instrumental inter- 
relationship. ; y ; 

(e) Both eligible, low instrumental interrelationship. 

In each group, S-I (randomly chosen) was informed 
that ten-dollar prizes were to be awarded to the best 
Ss. Accuracy of transmission was listed as one of the 


criteria, but speed was emphasized. For Condition 1, 
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S-I was told both he and his partner were eligible for 
these prizes; the prizes were to be awarded to the best 
teams. In the second and third conditions, S-I was 
informed that Æ was interested in studying the behavior 
of only one of the two in the group, and either that 
he, S-I, was the only one being studied and hence 
eligible (Condition 2) or that his partner was being 
studied and only he was eligible (Condition 3). S-II 
was never told about the prizes and was instructed 
only to “try to complete the task quickly.” Condition 
4 is comprised of these latter Ss. : o 

The S’s eligibility for the prize in these conditions 
of high instrumental interrelationships is assumed to 
result in a perceived high degree of dependency upon 
his partner, while the partner’s eligibility presumably 
causes S to view his partner as being highly dependent 
upon him. The first four conditions therefore can be 
considered as representing a 2 X 2 factorial design, 
varying the relations of presumed perceived dependence 
between S and his partner for the prize reward. 

In the fifth condition, S-I was told that he and his 
partner would be judged independently and that each 
was eligible for separate prizes. These instructions were 
designed to lower the perceived instrumental interrela- 
tionships between the group members by counteracting 
the manifest task structure. A questionnaire, consisting 
of four seven-point rating scales, was administered at 
the end of the experimental session to test the success 
of this manipulation in producing the belief that S's 


TABLE 1 
PERCEIVED INSTRUMENTAL INTERDEPENDENCY@ 


Condition 

Instru- 

EEEN 
Eligible for Prize pendency Mean SD 
1. Both High 18.89 6.30 
2. S alone High 20.56 5.23 
3. S's partner alone High 19.89 4.88 
4. No prize High 20.33 5.66 
5. Both Low 14.00 6.06 


a N = 9 in each condition. Ther 
ences among condition means. 


TABLE 2 
MEAN Time Improvement Scores 1N CONDITIONS 
VARYING IN DIRECTION AND DEGREE oF Dg- 
PENDENCY BECAUSE OF DIFFERENCES 
IN Prize ÅTTAINABILITY 


. Prize Not Attainable 
Prize Attainable By S By S 


Prize Prize Not Prize Prize Not 
Attainable Attainable Attainable Attainable 
By S's By S's By S’s By S's 
Partner Partner Partner 


(Condition (Condition (Condition (Condition 
1) 2) 3) 4) 


Partner 


Trials 
9 58.33 57.55 59.00 21.67 
Block 3 73.44 55.67 45.78 28.33 
All trials 65.89 56.61 52.39 25.00 
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attainment of the reward was not dependent upon the 
activity of his partner. A sample item was: “How 
dependent were you upon your partner in completing 
your task as quickly as possible?” As is indicated m 
Table 1 below, the items refer to the perceived instru- 
mental interrelationships between the team members, 
and not to the perceived degree of dependence resulting 
from the interaction of this instrumental interrelation- 
ship with goal value, Since the four items were 
significantly and Positively intercorrelated, S’s per- 
ceived instrumentality score was determined by 
summing his four ratings. 


All Ss were volunteers from liberal arts courses at the 
University of Wisconsin. There were nine Ss in each of 
the conditions, Three S-Is were chosen at random from 
each of the first three conditions and the S-//s paired 
with them were used for the Condition 4 data. Approx- 


imately half of the Ss were females distributed evenly 
among the conditions, 


RESULTS 


Dependency Differences Created by Differences 
in Goal Value 


The Ss in the first four conditions were led to 
believe that they and their partners were in- 
strumentally interrelated in completing the 
assigned task, As is shown in Table 1, Condi- 
tions 1 through 4 had essentially similar means 
on the postsession questionnaire assessing the 
Perceived instrumental relations between the 
team members, With perceived instrumentality 
constant, then, the direction and degree of 
dependency between team members in these 
conditions Presumably varied directly with 
eligibility for the Prize. Analyses of variance 
of the two sets of improvement scores iD 
Conditions 1 through 4 revealed significant 
Main effects for both S Eligible for Prise 
(E = 15.85 @ = OM}, ahd S's Purina Eligible 

į 10.14; p < .01). The first main 
© the success of the goal value 
- The more S stands to gain bY 
task rapidly, the faster he 
nd effect suggests that 5’S 
vation increases when he 
her group members are highly 
him for the attainment © 


8Toup-task moti 
Perceives that ot 
dependent upon 
their valued goals. 


ere were no significant differences in time 
Scores for the baseline Period among Conditions 
1 through 4, so Table 2 presents only the mea? 
™provement scores for the second and thir 
trial blocks. With the exception of the differ- 
ences between Condition 4 and the remaining 
Conditions in both trial blocks, the effects °; 
the perceived dependency relationships did 2° 
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become significant until Block 3. According 
to Hypothesis 2, Ss in Condition 1, who were 
told that both they and their partners were 
eligible for the valued rewards and, therefore, 
who may well have perceived a relatively high 
degree of interdependency, should be more 
highly motivated than the Ss in the other 
instrumentally interrelated conditions. This 
hypothesis was confirmed at an acceptable level 
of confidence (p < .05 by / tests) for each of the 
three comparisons with Condition 1 in the 
Block 3 data. 

The only other significant difference in 
Block 3 was that between Conditions 3 and 4 
(p = .05). In support of Hypothesis 3, Ss who 
were informed that the prize was attainable by 
their partners became highly motivated to 
perform their task even though they themselves 
Were not eligible for the reward. 


Effects of Perceived Instrumental Inter- 


dependency 


Both S-Z and his partner were eligible for the 
prize in Conditions 1 and 5, but the latter 
condition, it will be recalled, was designed to 
have low perceived instrumental interrelation- 
ships between the team members. Analysis of 
the perceived instrumentality scores indicated 
that the low instrumental dependency ma- 
nipulation was not completely successful; as is 
shown in Table 1, the Condition 5 mean was 
not significantly lower than the Condition 1 
mean (p = -15). A preliminary assessment of 
the motivational effects of this perceived in- 
strumental interrelationship was obtained by 
discarding the two Ss with the lowest perceived 
instrumentality scores from Condition 1 and 
the two Ss with the highest scores from Condi- 
tion 5 so that the conditions did differ sig- 
nificantly on this variable by / test (p < .05). 
The Block 3 difference in task motivation 
scores became significant with the elimination 
of these four Ss (p < .05), providing tentative 
support for Hypothesis 1. The improvement 
score means in Conditions 1 and 5 were now 
81.00 and 49.57 respectively. Thus, Ss who 
believe they and their partners are instru- 
mentally dependent upon each other for the 
attainment of a valued goal are more highly 
motivated than Ss who believe they and their 
partners can attain this goal independently of 
€ach other. 
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Discussion 


The obtained results generally support the 
hypotheses of the study. Perceived group mem- 
ber interdependency for reward attainment can 
increase an individual’s motivation toward the 
assigned group task. The findings also suggest 
that this heightened motivation can be a 
cumulative effect reflecting the self-perception 
that he is dependent upon his partner and also 
the perception that the partner is dependent 
upon him. 

If we assume that an individual who sees 
that the others in his group are dependent upon 
him and who also perceives that these others 
demand or expect certain task-relevant be- 
haviors-of him (in the present case, rapid 
work), the following general proposition can be 
formulated: the strength of the motivation to 
conform to the task-relevant expectations of 
others in the group is a positive function of the 
perceived degree of inlerdependence among the 
members. Intragroup opinion conformity under 
certain conditions, as well as many instances of 
behavioral conformity to group norms, can 
then be seen as special cases of this general 
proposition. For example, it is clear that pres- 
sures toward uniformity of opinion are 
heightened when “group locomotion” requires 
unanimity (6). To illustrate, Deutsch and 
Gerard (4) recently created this type of require- 
ment in some of their experimental conditions 
by offering prizes to the best groups. The Ss 
given this instruction probably realized that 
any group member’s chances of obtaining the 
desired goal was dependent upon the activity 
(i.e., expressed opinions) of the others in the 
group. Different instructions produced low 
perceived interdependency in other groups. As 
a result, the Ss in the highly interdependent 
groups exhibited a greater tendency to conform 
to the opinions of others in the group than the 
Ss in less interdependent groups even though 
it was fairly obvious that the group opinion 
was wrong and they could express their own 
opinions anonymously. 

Thus, perceived high interdependence ap- 
parently produces conformity to group opinions 
as well as high group task motivation, support- 
ing the hypothesis of a general relationship 
between perceived task interdependency and 
conformity to the task expectations of the 
others in the group. Attitudinal variables that 
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affect the degree to which interdependent 
members adopt each other’s opinions, then, 
may generally affect behavioral conformity to 
relevant shared expectations in interdependent 
groups. Dittes and Kelley (5), employing 
interdependent groups faced with the require- 
ment of consensus, found that the members’ 
adoption of the group opinion was a function 
of the extent to which they felt accepted by the 
group. If the present general proposition is 
correct, differences in acceptance by the group 
also should produce differences in the relation- 
ship between perceived interdependence and 
group task motivation. 


SUMMARY 


The present study provides a direct test of 
an hypothesis invoked to account for the 
results of two preceding experiments, viz.: 
group members perceiving themselves as 
interdependent for their reward attainment 
become highly motivated toward the group 
task. An analysis also is made of the com- 
ponent relationships involved in perceived 
interdependence between members of a two- 
person group. The results generally are in 
accord with the following predictions: 1. Mem- 
bers of groups who are instrumentally inter- 
dependent become more highly motivated 
toward the group task than Ss who can achieve 
their goals independently of the others in the 
group. 2. Under conditions of instrumental 
interdependence, Ss who are told that both 
they and their teammates can attain a valued 
goal through the group activity become more 
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highly task motivated than Ss who are told 
either that only S or only S’s partner can attain 
this goal. 3. Given that the group members are 
instrumentally interdependent and also in a 
position to see each other’s behavior, Ss told 
that only their partners have access toa valued 
goal become more highly motivated toward 
the task than Ss working for a goal that is less 


highly valued by both themselves and their 
partners. 
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THE EFFECT OF TRAINING UPON ACCURACY AND VARIABILITY 
IN INTERPERSONAL PERCEPTION! 


W. J. CROW? 
Behavior Research Laboratory, University of Colorado 


NE purpose of many training programs 
in clinical, counseling, and personnel 
psychology is to increase the ac- 

curacy of the trainee’s perceptions of others. 
The ability to “understand others,” or to 
have “clinical intuition” is considered a 
basic requirement for the good practitioner. 
It has been difficult to evaluate the success of 
such training, however, because until recently 
there has been no way to measure, by ob- 
jective tests, the subtle and complex psy- 
chological variables that are presumably 
detected by the practitioner. Therefore, 
determination of improvement in clinical 
judgment has itself depended upon other 
clinical judgments. That is, either the accuracy 
of the students’ perceptions would be directly 
rated by observers, or the students’ judg- 
ments would be compared with the judgments 
of experienced practitioners. 

Recent methodological improvements in the 
measurement of interpersonal perceptiveness 
have made it possible to measure the accuracy 
of clinical judgment in a more objective way- 
It is now customary to ask a person (the ob- 
ject) to fill out a personality questionnaire 
and then ask the trainee (the subject) to fill 
out the identical questionnaire just as he 
believed the object has filled it out. The 
measure of the subject’s accuracy is the degree 
of correspondence between the object’s actual 
responses and those which the subject esti- 
mated that the object had made. Such a 
technique has been used by Kelly and Fiske 
(7), for example, tor measure the diagnostic 
competence of clinical trainees. f 

Early studies using this technique for 
measuring interpersonal perception focused 
primarily upon accuracy, but attention has 
been called recently to the importance of 
other components of the interpersonal-per- 


1 This investigation was carried out as part of a larger 
study of medical education supported by the Common- 
Wealth Fund of New York. The author is indebted to 

r. Kenneth R. Hammond for assistance and advice 
and to Dr. Fred Kern, Jr., for his assistance during 
many stages of the study. 

? Now at San Diego College. 


ception situation (1, 2, 5). Reasoning mathe- 
matically from an analysis-of-variance model, 
Cronbach (1) has advanced a hypothesis for 
the relationship between accuracy and one of 
these components—the amount of variability 
in the subjects’ estimations: 

There is an opåmal degree of differentiation in mak- 
ing judgments. If a judge [subject] can make accurate ` 
judgments as to the relative location of others [objects] 
on a continuum, then he is wise to make øy (the varia- 
bility in his estimations) as large as ø+ (the variability 
in the others’ scores)—never larger. But if he is forced 
to base his judgment on inadequate cues or if the avail- 
able personality theory and situational knowledge do 
not permit trustworthy inference, then he should treat 
people as if they were very nearly alike. The person who 
attempts to differentiate individuals on inadequate data 
introduces error even when the inferences have validity 
greater than chance (1, p. 181). 


It follows from Cronbach’s hypothesis that, if 
the subject increases the variability of his 
judgments from one time to another and his 
ability to make accurate judgments does not 
increase correspondingly, then he will make 
greater errors the`second time than the first. 
It also follows that whenever a judgmental 
task is very difficult or the subjects are very 
low in ability, subjects with little variability in 
their estimations will make small errors, and 
subjects with high variability will make large 
errors. Stated otherwise, accuracy and varia- 
bility will be negatively correlated when the 
task is difficult or the subject is very inac- 
curate. 

The purpose of the study herein reported 
was twofold: (a) to investigate the effects of a 
training program upon interpersonal per- 
ception, and (b) to test the hypothesis of the 
relationship between accuracy and variability 
in estimations. 

To accomplish these purposes, interpersonal 
perception measures were administered at the 
beginning, during, and after the Senior year to 
a class of medical students who were divided 
into experimental and control groups. The 
experimental group was instructed in phy- 
sician-patient relationships and was provided 
with the opportunity to establish such re- 
lationships through more prolonged contact 
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with patients, than was possible for the con- 
trol group. It was expected (a) that the 
students in the experimental group would 
become more accurate in interpersonal per- 
ception than students in the control group, and 
(b) that variability in estimation would be 
negatively related to accuracy of judgment. 


METHOD 
Subjects and Treatments 


The Ss were 72 Senior medical students who were 
assigned at random to experimental and control groups. 
Approximately one fourth of the students were assigned 
to the experimental clinic during the first half of the 
year, and the remaining three-fourths were assigned to 
one of two surgery clinics or to an internal-medicine 
clinic. During the second half of the year, one of the 
first-half surgery groups was assigned to the experimen- 
tal clinic, the other was assigned to the internal- 
medicine clinic, and the first-half experimental and 
internal-medicine groups were assigned to the surgery 
clinics. All students were tested at the beginning (Pre- 
test), at the midpoint (Posttest I), and at the end 
(Posttest II) of their Senior year. 


Measures 


Thirty-six patients were selected at random? (with 
certain restrictions) from the medical wards of two 
hospitals, and each was interviewed by a physician. 
Six-minute sound films were taken of each of these 
interviews, and three randomly-selected sets of ten 
films were shown to the Ss, one of them at each of the 
three test administrations, 

The Ss were required to perform two tasks: Task A 
was to estimate the self-ratings of the patients on seven 
personality scales, which were paraphrases of selected 
Minnesota Multiphasic Inventory scales (6); Task B 
was to estimate where the patient “really was” on the 
same set of scales. The patient’s self-ratings were the 
criteria of accuracy on Task A; the patients’ perform- 
ance on the MMPI for Task B.* 


Scoring 


The Random-Comparison Method (2) was used to 
compute the scores, since this method yields a Differ- 
ential-Accuracy score by eliminating artifactual com- 
ponents from the raw scores. In addition to the Accu- 
racy scores, two response-set measures, which had 
previously been found to be stable aspects of the stu- 
dents’ performance (4), were investigated in the present 
study: 1. A measure of adherence to stereotype, defined 
as the inverse of the Average-Variance score of an §’s 
estimations for the scales. (An S may perceive wide 
differences among people in the characteristic under 
consideration, or he may perceive people as being very 


3 For a discussion of importance of the manner of the 
selection of the objects in interpersonal perception 
studies see (3). 

4 For a detailed description of the measures and the 
scoring techniques see (2). 
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much alike. He may, in other words, cluster his estima- 
tions closely or he may vary them widely. If an S 
clusters his estimations closely, his Average-Variance 
score will be small and his adherence to stereotype will 

be high. The variance of the estimations for each scale 
was computed and averaged for all scales as a measure 

of perceived individual differences.) 2. A measure of | 
assumed veridicality, defined as the inverse of the Av- 
erage Difference between the S’s estimation for the 
object’s self-rating and the S’s estimation for where the 
object “really was.” (An S could assume a large meas- 
ure of agreement between an object’s self-rating and 
“reality,” or he could assume little agreement. If an Ss 
assumed a large measure of agreement, the Average- 
Difference score between his estimation for the object’s 
self-rating and his estimation for where the object 
“really was” would be small, and the assumed veridi- 
cality would be high.) 

The Ss’ accuracy, adherence to stereotype, and as- 
sumed veridicality were measured in this fashion for 
each of the three administrations. Change scores were 
computed from pre- to posttest in order to compare the A 
experimental and control groups and to permit investi- T 
gation of the relationship between accuracy and the 
two response-set characteristics. 


7 k 
Effect of Training upon Interpersonal Percephton f 


Table 1 presents the means and standard 
deviations of the change in the Ss’ performance 
between the Pretest and Posttest I (first-hal 
change) and between the Posttest I an 
Posttest II (second-half change) for the 
Accuracy, Average-Variance, and Average- 
Difference scores. The significance of the 
difference between the mean change for the 
experimental and control groups was com- 
puted by use of the ¢ test for small samples- 

Examination of the accuracy results Te- 
vealed that the control-group improvement 
was significantly greater than the expel 
mental-group improvement for one of the 
four measures (Task B, first half). The control- 
group improvement was also greater on two ° ' 
the three remaining Accuracy scores, but the 
difference was not statistically significant. 
Contrary to what had been expected, the | 
experimental group, which received training ” 
interpersonal relations, did not improve more $} 
than the control group, which had not receive si 
such training. In fact, the trend of resu eg 
Suggests the opposite conclusion: that th i 
training in interpersonal relations decreas® 
accuracy. he 

Table 1 also presents the results for ia 
Average-Variance scores. For each of t eg 
four measures the experimental-group chang’ s 
was significantly different from the cont?” 
group change. The experimental group 
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creased the average variance of their esti- 
mations more than did the control group from 
pre- to posttest. They exhibited more dif- 
ferentiation than did the control group; that 
is, they manifested less adherence to stereo- 
type. 

The results for the Average-Difference 
Scores were similar. The experimental group 
changed more than did the control group in 
the amount of average difference that they saw 
between a patient’s self-rating and where the 
Patient “really was” on the personality scales. 
The results were statistically significant for 
both the first and second halves. Since the Ss 
in the experimental group increased their 
Average-Difierence scores, they assumed less 
veridicality than did the control-group stu- 
dents. 

These results indicate that the effect of the 
training program for the experimental group 
Was to increase the amount of differentiation 
that the Ss made in their judgments about 
patients, There also appears to be a loss of 
accuracy, but the results were not conclusive. 


Relationship between Accuracy and Variability 
in Estimations 

_ The results obtained for the effect of train- 

ing upon interpersonal perception were in 

accord with Cronbach’s hypothesis that if an 


TABLE 1 
MEAN INCREASE, STANDARD DEVIATIONS AND i FOR 
DIFFERENCES BETWEEN GROUPS FOR INTERPER- 
SONAL PERCEPTION SCORES 


Experimental Control 
ist Half 
N = 20 ist Half N = 
Measures ance and Half N = 
Mean Mean) 


In- SD | In- | SD t 
crease 
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TABLE 2 
CORRELATION COEFFICIENTS BETWEEN INCREASE IN 
Accuracy, INCREASE IN AVERAGE-VARIANCE 
Scores AND INCREASE IN AVERAGE-DIFFER- 
ENCE SCORES* 


Increase in Accuracy 
Task A Task B 
Ist | 2nd | ist 2nd 
Half |Half| Half | Half 
Increase in Average-Vari- 
ance scores 
Task A First half —.19 
. Second half 13 
Task B First half — .50* 
Second half —.27* 
Increase in Average-Differ- 
ence scores 
First half i AE .45* Ş 
Second half .07 —.26 
aN = 7. 


* Significant at .05 level. 


S increases his differentiation among objects 
without a corresponding increase in estimating 
ability, he will make greater errors. But the 
findings reported apply to the groups of Ss 
only. A more direct way to study the relation- 
ship of variability and accuracy is to correlate 
the scores of the individual Ss on the two 
measures; therefore, change in Accuracy 
scores was correlated with the change in 
in Average-Variance and Average-Difference 
scores for the first and second halves. 

Table 2 presents the correlation coefficients 
obtained between change in variability and 
change in accuracy. Four of the eight correla- 
tions (all for Task B) were negative and sig- 
nificant beyond the .05 level of confidence. 
These results also support Cronbach’s hy- 
pothesis. 


TABLE 3 
CORRELATION COEFFICIENTS BETWEEN ACCURACY AND 
AVERAGE-VARIANCE AND AVERAGE-DIFFERENCE 
Scores FOR THREE ADMINISTRATIONS 


Accuracy scores 
Task A First half 1.70 | 5.95 | 3.88) 5.92 |1.36 Accuracy 
Second half —3.06 | 6.74 |—3.89) 6.17 [0.46 
Task B First half —12.65 | 8.67 |—6.81| 2.78 |2.46* Task A Task B 
Second half 12.29 | 6.52 | 14.82| 7.95 |1.16 
Average-Variance scores Average-Variance 
Task A First half 0.36 | 2.99 |—1.51] 3.43 |2.08° Pretest ‘51 sh 
Second half 1.53 | 3.51 |—0.29) 2.50 |2.30* Posttest I -30 76 
Task B First half 1.09 | 3.29 |—1.57| 3.47 |2.85** Posttest II 37 55 
Second half 2.78 | 2.82 | 0.83] 2.57 |2.59*  Average-Difference 
Average-Difference scores Pretest 34 Sif 
First half 0.17 | 0.19 |—0.06) 0.26 |3.48** Posttest I 35 .56 
Second half 0.11 | 0.21 |—0.06) 0.21 |2.78°* Positest IT 33 49 
N= 72. 


*p= <05. 
"p= <0. 


r = .28 at the .01 level. 
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Table 3 presents the correlations for each 
administration between Accuracy scores and 
both Average-Variance and Average-Difference 
scores. All of the twelve correlation coefficients 
were negative and significant beyond the .01 
level of confidence. As the Ss made more 
variable estimations, their accuracy decreased. 
These results support quite conclusively the 
hypothesis that accuracy is negatively related 
to the amount of variability in Ss’ estimations. 


Discussion 


Contrary to expectation, the group which 
received more training in interpersonal rela- 
tions (the experimental group) did not improve 
in accuracy of interpersonal perception more 
than did the group without such special 
training (control group). In fact, the results 
support the opposite conclusion: that, as a 
result of their special training, the experi- 
mental group became less accurate than the 
control group. 

The experimental group increased the 
variability in their estimations significantly 
more than did the control group. The experi- 
mental group also assumed significantly less 
agreement between the way Ss would rate 
themselves and the way they “really were.” 
These results, in conjunction with the fact 
that the experimental group decreased in 
accuracy, support Cronbach’s hypothesis that, 
if an S increases his differentiation without a 
corresponding increase in ability, he will make 
greater errors. This hypothesis was further 
supported by the finding that individual 
Accuracy scores were negatively correlated 
with both Average-Variance and Average- 
Difference scores. Additional (though less 
conclusive) support for the hypothesis 
stemmed from the finding that change in 
variability from pre- to posttest was nega- 
tively correlated with change in accuracy. The 
more that Ss increased the amount of varia- 
bility they placed in their estimations, the 
less accurate they became. These results, 
taken together, constitute compelling evidence 
that the effect of the training program was to 

increase the variability that the trainees 
placed in their judgments and that their 
accuracy therefore decreased. 

Investigation of the relationship between 
variability and accuracy in interpersonal 
perception has not progressed to the point 
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where Cronbach’s hypothesis can be recom- 
mended as a basis for action. However, the 
implications of the hypothesis are so important 
that they deserve statement, particularly in 
regard to training programs such as the one 
reported in this study. These results indicate 
that training programs devoted to increasing 
accuracy of interpersonal perception (as in 
medical education and in the training of 
teachers, clinical psychologists, and similar 
specialists) run the risk of decreasing accuracy 
when they increase the trainees’ responsiveness 
to individual differences. Since very little is 
known about how to train people to make more 
accurate judgments about others, training 
programs frequently utilize a procedure of 
“exposure” and little else. The belief that 
placing the trainee in a position to observe 
others and to make judgments will produce 
desirable results is challenged by these findings- 
Such experience may lead the trainee to 
differentiate among people far beyond his 
capacity to do so accurately. In the absence 
of dependable measures of his accuracy, the 
trainee lacks knowledge of his errors and may 
continue inappropriate overdifferentiation long 
after the training has ceased. 

Cronbach has discussed this problem as 
follows: “Recognizing an optimum degree 0 
differentiation makes it necessary tO 
re-examine and qualify statements commonly 
made in training teachers, to the effect that 
every pupil has his own pattern and the teacher 
must fit methods to that pattern, not treat the 
pupil in terms of the statistical average 
(1, P. 183). Whatever the ethical, moral, oF 
ideological justification for emphasizing the 
unique characteristics of each person’s ind 
viduality, the evidence so far indicates that 
“differentiation is harmful if the extent ° 
adaptation or differentiation exceeds thé 
amount justified by the accuracy of soci# 
perception” (1, p. 183) 


SUMMARY 


The present study investigated the relation’ 
ship between accuracy and variability 
estimations and the effects of a training P” 


gram in interpersonal relations ne of 
interpersonal perception, Seventy-two Se? rie 
medical students were divided into an e*P® jp 
mental group who received training ol 


physician-patient relationships and a app: 
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group who did not receive such training. At 
the beginning, during, and at the end of their 
Senior year, the students estimated the real 
Status and self-ratings of patients presented 
in sound-film recorded interviews. Actual 
self-ratings and relevant MMPI scores were 
available for these patients as criteria. 
Contrary to expectation, the experimental 
group tended to become less accurate than the 
Control group. The experimental group in- 
creased the variability in their estimations 
Significantly more than did the control group 
and also assumed significantly less agreement 
between the way subjects would rate them- 
selves and the way they “really were.” Ac- 
curacy scores were negatively correlated with 
variability scores and change in variability of 
estimation was negatively correlated with 
change in accuracy. These results, which agree 
with a hypothesis advanced by Cronbach, sug- 
gest that training programs devoted to in- 
creasing accuracy of interpersonal perception 
Tun the risk of decreasing accuracy when they 


increase the  trainee’s 
individual differences. 


responsiveness to 
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REFERENCE GROUPS, MEMBERSHIP GROUPS, AND 
ATTITUDE CHANGE! 
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been recognized that an individual’s 

membership groups have an important 
influence on the values and attitudes he holds. 
More recently, attention has also been given to 
the influence of his reference groups: the groups 
in which he aspires to attain or maintain mem- 
bership. In a given area, membership groups 
and reference groups may or may not be 
identical. They are identical when the person 
aspires to maintain membership in the group of 
which he is a part; they are disparate when the 
group in which the individual aspires to attain 
membership is one in which he is not a member. 
It has been widely asserted that both member- 
ship and reference groups affect the attitudes 
held by the individual (4). 

The present study is an examination of the 
attitude changes which occur over time when 
reference groups and membership groups are 
identical and when they are disparate. The 
study takes advantage of a field experiment 
which occurred in the social context of the lives 
of the subjects, concerning events considered 
vital by them. The subjects were not aware 
that their membership and reference groups 
were of research interest; in fact, they did not 
know that the relevant information about these 
was available to the investigators. 

The field experiment permitted a test of the 
general hypothesis that both the amount and 
the direction of a person’s attitude change over 
time depends on the attitude norms of his 
membership group (whether or not that group 
is chosen by him) and on the attitude norms of 
his reference group. 


T SOCIAL psychological theory, it has long 
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This hypothesis is tested with subjects who 
shared a common reference group at the time of 
the initial assessment of attitudes. They were 
then randomly assigned to alternative mem- 
bership groups, some being assigned to the 
chosen group and others to a nonchosen group. 
Attitudes were reassessed after a year of ex- 
perience in these alternative membership 
groups with divergent attitude norms. During 
the course of the year, some subjects came to 
take the imposed (initially nonpreferred) 
membership group as their reference group. 
Attitude change after the year was examined 
in terms of the membership group and reference 
group identifications of the subjects at that 
time. 


THE FIELD EXPERIMENT 


The Ss of this study were women students at 
a large private coeducational university. The 
study was initiated shortly before the end of 
their freshman year, when they all lived in the 
same large freshman dormitory to which they 
had been assigned upon entering the university- 
At this university, all women move to new 
housing for their sophomore year. Several 
types of housing are available to them: a large 
dormitory, a medium-sized dormitory, several 
very small houses which share common dining 
facilities, and a number of former sorority 
houses which have been operated by the uni- 
versity since sororities were banished from the 
campus. These latter are located among the 
fraternity houses on F raternity Row, and are 
therefore known as “Row houses.” Although 
the Row houses are lower in physical comfort 
than most of the other residences for wome!» 
students consider them higher in social status: 
This observation was confirmed by a poll ° 
students (5, p. 205), in which over 90 per cent 
of the respondents stated that Row houses f 
women were higher in social status than 20” 
Row houses, the remaining few disclaiming 2” 
information concerning status differen’ 
among women’s residences. 
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In the Spring of each year, a “drawing” is 
held for housing for the subsequent year. All 
freshmen must participate in this drawing, 
and any other student who wishes to change 
her residence may participate. It is conducted 
by the office of the Dean of Women, in co- 
operation with woman student leaders. Any 
participant’s ballot is understood to be secret. 
The woman uses the ballot to rank the houses 
in the order of her preference. After submitting 
this ballot, she draws a number from the 
hopper. The rank of that number determines 
the likelihood that her preference will be 
satisfied. 

In research reported earlier (5), a random 
sample was drawn from the population of 
freshman women at this university, several 
tests were administered to the Ss in that 
sample, and (unknown to the Ss) their housing 
preferences for the forthcoming sophomore 
year were observed by the investigator. The Ss 
were characterized as “high status oriented” if 
they listed a Row house as their first choice, 
and were characterized as “low status oriented” 
if they listed a non-Row house as their first 
choice. The hypothesis under test, drawn from 
reference group theory and from theoretical 
formulations concerning authoritarianism, was 
that high status orientation is a correlate of 
authoritarianism. The hypothesis was con- 
firmed: freshman women who listed a Row 
house as their first choice for residence scored 
significantly higher on the average in author- 
itarianism, as measured by the E-F scale (1, 2) 
than did women who listed a non-Row house 
as their first choice. The present study is a 
continuation of the one described, and uses as 
its Ss only those members of the original 
sample who were “high status oriented,” i.e., 
preferred to live ina Row house for the soph- 
omore year. In the initial study (5), of the 95 
Ss whose housing choices were listed, 39 were 
“high status oriented,” i.e., demonstrated that 
the Row was their reference group by giving a 
Row house as their first choice in the drawing. 
Of this group, 28 were available to serve as Ss 
for the follow-up or “change” study which is 
the topic of the present paper. These women 
form a homogeneous subsample in that at the 
conclusion of their freshman year they shared 
a common membership group (the freshman 
dormitory) and a common reference group 
(the Row). These Ss, however, had divergent 


experiences during their sophomore year: nine 
were Row residents during that year (having 
drawn sufficiently small numbers in the housing 
drawing to enable them to be assigned to the 
group of their choice) and the other 19 lived 
in non-Row houses during that year (having 
drawn numbers too large to enable them to be 
assigned to the housing group of their choice). 

E-F scores were obtained from each of the 28 
Ss in the course of a large-scale testing program 
administered to most of the women students at 
the university. Anonymity was guaranteed to 
the Ss, but a coding procedure permitted the 
investigators to identify each respondent and 
thereby to isolate the Ss and compare each S’s 
second E-F score with her first. 

To prevent the Ss from knowing that they 
were participating in a follow-up study, several 
procedures were utilized: (a) many persons 
who had not served in the earlier study were 
included in the second sample, (b) the testing 
was introduced as being part of a nation-wide 
study to establish norms, (c) the test admin- 
istrators were different persons from those who 
had administered the initial tests, (¢) Ss who 
informed the test administrator that they had 
already taken the “Public Opinion Question- 
naire” (E-F scale) were casually told that this 
did not disqualify them from participating in 
the current study. 

The Ss had no hint that the research was in 
any way related to their housing arrangements. 
Testing was conducted in classrooms as well as 
in residences, and all procedures and instruc- 
tions were specifically designed to avoid any 
arousal of the salience of the housing groups 
in the frame of reference of the research. 

The annual housing drawing was conducted 
three weeks after the sophomore-year testing, 
and, as usual, each woman’s housing ballot was 
understood to be secret. In this drawing, each 
S had the opportunity to change her mem- 
bership group, although a residence move is 
not required at the end of the sophomore year 
as it is at the end of the freshman year. If an 
S participated in this drawing, the house which 
she listed as her first choice on the ballot was 
identified by the investigators as her reference 
group. If she did not, it was evident that the 
house in which she was currently a member was 
the one in which she chose to continue to live, 
i.e., was her reference group. With the informa- 
tion on each S’s residence choice at the end of 
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her freshman year, her assigned residence for 
her sophomore year, and her residence choice 
at the end of her sophomore year, it was pos- 
sible to classify the subjects in three categories: 


A. Women (x = 9) who had gained assignment to 
live on the Row during their sophomore year and 
who did not attempt to draw out of the Row at 
the end of that year; 

B. Women (n = 11) who had not gained assignment 
to a Row house for the sophomore year and who 
drew for a Row house again after living in a non- 
Row house during the sophomore year; and 

C. Women (w = 8)who had not gained assignment to 
a Row house for the sophomore year, and who 
chose to remain in a non-Row house after living in 
one during the sophomore year, 


For all three groups of Ss, as we have pointed 
out, membership group (freshman dormitory) 
and reference group (Row house) were common 
at the end of the freshman year. For Group A, 
membership and reference groups were identical 
throughout the sophomore year. For Group B, 
membership and reference groups were dis- 
parate throughout the sophomore year. For 
Group C, membership and reference groups 
were initially disparate during the sophomore 
year but became identical because of a change 
in reference groups. 


As will be demonstrated, the Row and the 
non-Row social groups differ in attitude norms, 
with Row residents being generally more au- 
thoritarian than non-Row residents. From 
social psychological theory concerning the 
influence of group norms on individuals’ at- 
titudes, it would be predicted that the different 
group identifications during the sophomore 
year of the three groups of Ss would result in 
differential attitude change. Those who gained 
admittance to a Row house for the sophomore 
year (Group A) would be expected to show the 
least change in authoritarianism, for they 
spent that year in a social context which re- 
inforced their initial attitudes. Group C Ss 
would be expected to show the greatest change 
in authoritarianism, a change associated not 

only with their membership in a group (the 
non-Row group) which is typically low in au- 
thoritarianism, but also with their shift in 
reference groups, from Row to non-Row, i.e., 
from a group normatively higher in author- 
itarianism to a group normatively lower. The 
extent of attitude change in the Ss in Group B 
would be expected to be intermediate, due to 
the conflicting influences of the imposed mem- 
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bership group (non-Row) and of the unchanged 
reference group (Row). The research hypoth- 
esis, then, is that between the time of the 
freshman-year testing and the sophomore-year 
testing, the extentof change inauthoritarianism 
will be least in Group A, greater in Group B, 
and greatest in Group C. That is, in extent of 
attitude change, Group A < Group B 
< Group C. 
RESULTS 

Group norms. From the data collected in the 
large-scale testing program, it was possible to 
determine the group norms for authoritarian 
attitudes among the Row and the non-Row 
women at the university. The E-F scale was 
administered to all available Row residents 
(x = 303) and toa random sample of residents 
of non-Row houses (n = 101). These Ss were 
sophomores, juniors, and seniors. The mean 
E-F score of the Row women was 90, while the 
mean E-F score of the non-Row was 81. The 
E-F scoresof the two groups were demonstrated 
to differ at the p < .001 level (x? = 11.1) by 
the median test (6, pp. 111-116), a non- 
parametric test, the data for which are shown 
in Table 1. 

Altitude change. The central hypothesis of 
this study is that attitude change will occur 
differentially in Groups A, B, and C, and that 
it will occur in the direction which would be 
predicted from knowledge of the group norms 
among Row and non-Row residents in general. 
The 28 Ss of this study had a mean E-F score 
of 102 at the end of their freshman year. The 
data reported above concerning authoritarian- 
ism norms for all women residing on campus 
would lead to the prediction that in general the 
Ss would show a reduction in authoritarianis™ 
during the sophomore year but that this Te 
duction would be differential in the three 
groups; from the knowledge that Row residents 


TABLE 1 
FREQUENCIES oF E-F Scores ABOVE AND 
BELow Common MEDIAN ror Row 
AND Noy-Row RESIDENTS 


Residents Residents 

“Houses” Houses Total 
Above Median 36 166 202 
Below Median 65 137 joe 
Total 101 303,408 
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TABLE 2 
FRESHMAN-YEAR AND SoOPHOMORE-YEAR 
E-F Scores or SUBJECTS 


E-F Score 
End of Fresh- End of Sopho- 
Group man Year more Year Difference 

108 125 ~17 

70 78 —8 

106 107 -1 

92 92 0 

A 80 78 2 
104 102 2 

143 138 5 

110 92 18 

114 80 34 

76 117 —4l 

105 107 -2 

88 82 6 

109 97 12 

98 83 15 

B 112 94 18 
101 82 19 

114 93 21 

104 81 23 

116 91 25 

101 74 27 

121 126 =g 

87 79 8 

105 95 10 

97 81 16 

c 96 78 18 
108 73 35 

114 77 37 

88 49 39 


generally are higher in authoritarianism than 
non-Row residents, the prediction based on 
social group theory would be that Group A 
would show the smallest reduction in author- 
itarianism scores, Group B would show a larger 
reduction, and Group C would show the largest 
reduction. The data which permit a test of this 
hypothesis are given in Table 2. The 
Jonckheere test (3), a nonparametric k- 
sample test which tests the null hypothesis 
that the three groups are from thesame popula- 
tion against the alternative hypothesis that 
they are from different populations which are 
ordered in a specified way, was used with 
these data. By that test, the hypothesis is 
confirmed at the p < .025 level. 


DISCUSSION 


Substantively, the present study provides 
experimental verification of certain assertions 
in social group theory, demonstrating that 
attitude change over time is related to the 
group identification of the person—both his 


membership group identification and his 
reference group identification. The hypothesis 
that extent of attitude change would be differ- 
ent in the three subgroups of Ss, depending on 
their respective membership group and 
reference group identifications, is confirmed at 
the p < .025 level; in extent of change in au- 
thoritarianism, Group A < Group B < Group 
C, as predicted. 

Another way of looking at the data may 
serve to highlight the influence of membership 
groups and reference groups. At the end of the 
freshman year, the Ss in Groups A, B, and C 
shared the same membership group and the 
same reference group. During the sophomore 
year, the Ss in Group A shared one member- 
ship group while those in Groups B and C 
together shared another. From membership 
group theory, it would be predicted that the 
extent of attitude change would be greater 
among the latter Ss. This hypothesis is sup- 
ported by the data (in Table 2): by the Mann- 
Whitney test (6, pp. 116-127), the change scores 
of these two sets of Ss (Group A versus Groups 
B and C together) differ in the predicted 
direction at the p < .025 level. This finding 
illustrates the influence of membership groups 
on attitude change. On the other hand, at the 
conclusion of the sophomore year, the Ss in 
Groups A and B shared a common reference 
group while those in Group C had come to 
share another. From reference group theory, it 
would be predicted that attitude change would 
be more extensive among the subjects who had 
changed reference groups (Group C) than 
among those who had not. This hypothesis is 
also supported by the data (in Table 2): by 
the Mann-Whitney test, the change scores of 
these two sets of Ss (Groups A and B together 
versus Group C) differ in the predicted direc- 
tion at the p < .05 level. This finding illustrates 
the influence of reference groups on attitude 
change. Any inference from this mode of 
analysis (as contrasted with the main analysis 
of the data, by the Jonckheere test) must be 
qualified because of the nonindependence of 
the data on which the two Mann-Whitney 
tests are made, but it is mentioned here to 
clarify the role which membership and reference 
groups play in influencing attitude change. 

The findings may also contribute to our 
understanding of processes affecting attitude 
change. The imposition of a membership 
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group does have some effect on an individual’s 
attitudes, even when the imposed group is not 
accepted by the individual as his reference 
group. This relationship is shown in the case of 
Group B. If the person comes to accept the 
imposed group as his reference group, as was 
the case with the Ss in Group C, then the 
change in his attitudes toward the level of the 
group norm is even more pronounced. 
Methodologically, the study has certain 
features which may deserve brief mention. 
First, the study demonstrates that it is possible 
operationally to define the concept of reference 
group. The act of voting by secret ballot for the 
group in which one would like to live con- 
stitutes clear behavioral specification of one’s 
reference group, and it is an act whose con- 
ceptual meaning can beso directly inferred that 
there is no problem of reliability of judgment 
in its categorization by the investigator. 
Second, the study demonstrates that a field 
study can be conducted which contains the 
critical feature of an experiment that is usually 
lacking in naturalistic situations: randomiza- 
tion. The determination of whether or not a 
woman student would be assigned to the living 
group of her choice was based on a random 
event: the size of the number she drew from 
the hopper. This fact satisfied the requirement 
that the treatment condition be randomized, 
and permitted sharper inferences than can 
usually be drawn from field studies, Third, the 
test behavior on which the conclusions of this 
study were based occurred in a context in 
which the salience of membership and reference 
groups was vot aroused and in which no external 
sanctions from the relevant groups were oper- 
ative. This feature of the design permitted the 
interpretation that the E-F scores represented 
the Ss’ internalized attitudes (4, p. 218), 
Finally, the use of a paper-and-pencil measure 
of attitude and thus of attitude change, rather 
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than the use of some more behavioral measure, 
is a deficiency of the present study. Moreover, 
the measure which was used suffers from a 
well-known circularity, based on the occurrence 
of pseudo-low scores (1, p. 771; 5, pp. 221-222). 


SUMMARY 


In the social context of the lives of the sub- 
jects, and in a natural social experiment which 
provided randomization of the relevant condi- 
tion effects, the influence of both membership 
and reference groups on attitude change was 
assessed. All subjects shared a common reference 
group at the start of the period of the study. 
When divergent membership groups with 
disparate attitude norms were socially imposed 
on the basis of a random event, attitude change 
in the subjects over time was a function of the 
normative attitudes of both imposed member- 
ship groups and the individuals’ reference 
groups. The greatest attitude change occurred 
in subjects who came to take the imposed, 
initially nonpreferred, membership group 45 
their reference group. 
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SOCIOECONOMIC STATUS AND CONCEPTUAL THINKING! 
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LTHOUGH there is a considerable body 
A of data indicating differences between 
socioeconomic groups on scores made 
on commonly used tests of intelligence, the 
meaning and significance of these differences 
is debatable. A better understanding of factors 
relating to socioeconomic group differences in 
mental abilities may be attained if we deal 
with separate phases of the intellective process 
rather than with intelligence in its more global 
and ambiguous sense. The present concern is 
therefore with the ability to conceptualize 
and with its related process, concept for- 
mation. For this purpose, the following 
characterization of concept formation is 
adequate: “Behavior governed by concepts 
requires that the same response shall be made 
to objects of the same class, but a different 
response, or no response at all to objects not 
belonging to that class” (21, p. 609). 

Davis and Havighurst (2, 3) have pointed 
out that status differentiations have the effect 
in varying degrees of defining and limiting 
the developmental environment of the child. 
They, among others, maintain that social class 
is an important factor in determining the 
development of meanings. 

Eells (6), a member of this “culture fair” 
group, purportedly demonstrated the exis- 
tence of a middle-class bias on intelligence 
tests, after an analysis of individual test items. 
Without necessarily accepting Eells’ con- 
clusion about test bias, there are two aspects 
of his data which are pertinent for the present 
study. Eells reported that mean status dif- 
ferences are largest for verbal and smallest for 
nonverbal material, and that the quality of 
abstraciness in the item appeared to produce 
large status differences. All differences were in 
the direction of high status superiority. 

As in intelligence testing, there are virtually 
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no definitive experimental studies relating 
conceptual ability to socioeconomic status. 
Two investigations concern themselves with 
physical causality (4, 10). Both report little 
relationship between the scores on their 
concept tests and socioeconomic status. Each 
study, however, has methodological limi- 
tations, i.e. Huang’s study (10) has many 
uninterpretable responses that may have 
influenced the analysis, while Deutsche’s work 
has been severely criticized by Lacey (12). 
Finally, both are based upon a restricted aspect 
of conceptualization. 

Dixon (5) reports significant differences on a 
test of contradictory relationships between 
children two to six years old when compared 
on the basis of parental occupation (children of 
students and professors vs. working mothers). 
Dixon explains these differences in terms of 
differential intelligence. 

Findlay and McGuire (7), using a variation 
of the Vigotsky test, hypothesized that if 
cultural bias affects test performance, children 
from lower social status backgrounds might be 
expected to exhibit a higher degree of problem 
solving capacity (whatever that may be) than 
children from higher social status backgrounds 
in order to achieve the same range of scores in 
mental test performance as the children with 

obvious cultural advantages. Children from 
HS and LS backgrounds (grade levels 3, 6, and 
9) were equated on IQ as measured by a group 
intelligence test and compared on the sup- 
posedly less culturally weighted conceptual 
test. The hypothesis was not validated, and 
differences were found in the opposite direction. 

The above studies have demonstrated that 
the content of the test item is not the only 
factor in social status group differences. 
Another often mentioned factor is the form of 
symbolism (e.g., verbal or nonverbal) by which 
the item is presented. Eells’ (6) findings in this 
respect have already been mentioned. Tyler 
(19) demonstrated that when Eells’ data were 
converted into standard score form, the mean 
status differences between verbal and non- 
verbal items were no longer significant. 
Havighurst and Janke (8) and McGurk 
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(16, 17) also report negative evidence in this 
respect. These studies indicate that it is by no 
means certain that class differences are more 
marked on verbal than nonverbal tests. 

Conceptual ability in the abstract can only 
be expressed indirectly and through the media 
of certain forms of symbolism (verbal and 
nonverbal). The resultant group differences 
may not be differences in ability to concep- 
tualize, as much as differences in proficiency 
(learned) with the type of symbolism used. The 
role of the supposed middle-class verbal 
superiority must thus be assessed. The initial 
purpose of this study was to determine whether 
status differences on conceptual tests are a 
function of the form of symbolism by means of 
which conceptual ability is being tested. 

It has been noted that low status (LS) 
children tend to operate, as a group, on a 
more concrete level than do high status (HS) 
children (2, 14). Observation of the time 
perspective of LS and HS children indicates 
that the former are more restricted in this 
respect (13). Time perspective and sense of 
abstractness should be closely related, since a 
broad time perspective, by definition, cor- 
responds with an abstract mode of response 
and a narrow time Perspective with an im- 
mediate or concrete mode. This observation 
suggests that social Status differences on tests of 
cognitive ability might also be a function of 
actual cognitive differences, 

The present study therefore attempts an 
evaluation of the importance of the 
variables of form of symbolism and abstract- 
ness-concreteness for status differ 
tests of conceptual ability, 

The following hypotheses have been pro- 
posed as generalizations from findings in the area 
of intelligence testing to concept formation, 
and are tested in the present investigation, 

Hypothesis 1. HS children score more 
highly than LS children on all tests of con- 
ceptual ability. 

Hypothesis 2. The form of symbolism, 
verbal or nonverbal, of the conceptual test is a 
factor contributing to the status differences, 
Greater status differences are predicted on 
verbal material than on nonverbal material, 

Hypothesis 3. When alternative answers are 
available which are alike in correctness but 
differ along the concrete-abstract dimension, 
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HS children more frequently select abstract 
answers than do LS children. 

The foregoing hypotheses pertain to direct 
comparisons of the two groups wherein the 
subjects are representative of their particular 
status. The following hypotheses pertain to 
comparisons in which subjects from each group 
are matched on the basis of a specified variable. 

Hypothesis 4. When subjects from the two 
groups are matched on the basis of nonverbal 
scores, HS children score more highly than 
LS children on tests of verbal concepts dealing 
with the same kind of conceptualization as the 
nonverbal tests, 

Hypotheses 5. When subjects from the two 
groups are matched on the basis of scores on 
verbal concept tests, LS children score more 
highly than HS children on tests of nonverbal 
concepts dealing with the same kind of con- 
ceptualization as the verbal tests. 

Hypotheses 4 and 5 are directed, in part, 
toward the problem of the hierarchical de- 
velopment of concepts as discussed by 
Heidbreder (9). Tt is assumed in Hypothesis 4 
that LS and HS children equated on non- 
verbal material will draw apart, to the ad- 
vantage of the latter, when verbal material is 
introduced. An explanation of the status 
differences in terms of verbal facility (Hypoth- 
esis 2) would not be contributive to the issue 0 
innate group differences vs. differential and 
unequal group experiences. The latter po- 
sition would argue that LS individuals simply 
have not had the opportunity to achieve 
verbal skills. Advocates of innate group 
differences might advance an alternate expla- 
nation in terms of hierarchical development. 
They might argue that verbal superiority is # 
natural outgrowth of greater intellectual 
ability in that verbal material is less “thing 
like” than nonverbal material and therefore 
requires a higher order intellectual process. 

„„ Hypothesis 5 is a more direct check on the 
hierarchy” Principle. If one assumes, as di 
Findlay and McGuire (7), that greater overal 
ability is required by the LS child to do as wel! 
as the HS child on tests of verbal concept 
then, when nonverbal material is introduce% 

the LS child should be at an advantage. 

Validation of the two hypotheses would serve 
to rule out hierarchical order of concept 
attainment as a factor in this study. Failure to 


hes 


SOCIOECONOMIC STATUS AND CONCEPTUAL THINKING 


do so, however, would not serve as evidence for 
either position. 


METHOD 


Subjects 


The Ss were 181 white sixth-grade New York City 
public school children whose average age was about 
eleven. They were divided into two groups, HS and LS. 
There were 99 in the HS group and 82 in the LS group. 
Only white middle-class children in a middle-class school 
and white lower-class children in a lower-class school 
were used. In all, ten classes from four schools were 
tested, 

The average IQ of the two groups differed signifi- 
cantly at well beyond the .001 level. The intelligence 
test was the Otis QSMA, Form M. Summary data are 
presented in Table 1. Table 1 shows also that the HS 
group has a mean reading score of 8.03 on the Metro- 
politan Achievement Elementary Reading Test, a 
Score two years greater than the mean score of the LS 
group, although the LS group is approximately at the 
reading level appropriate for its grade. 


Determination of Status 


Status was primarily determined by free lunch data 
and a questionnaire filled out by the parents. The ques- 
tionnaire, a modification of the Warner ISC (20), ob- 
tained information about education and occupation of 
both parents, number of people in the apartment and 
dwelling area. The 181 Ss finally used were from a larger 
pool of 309. The others were eliminated because of 
undetermined status, non-white, etc. 


Procedure 

Each S was given the complete battery of 
tests, unless absent from one of the testing 
sessions. All of the tests were group pencil-and- 
paper examinations with generous time limits. 
None of the sessions exceeded sixty minutes. 
Each class was visited three times for testing. 


Description of Tests? 


Classification tests. Two such tests were 
given: one verbal (VC) and the other non- 
verbal (NVC). The NVC test was part of the 
Lorge-Thorndike Intelligence Tests-Level 3, 
Form A (14). Three objects are shown to the 
left of the page. The S must pick one of the 
five objects to the right which best goes with 


2? Two additional tests were used in the original study 
(18): one was intended to be a nonverbal analogue of 
the Abstract-Concrete Similarities Test and the other 
a measure of status-oriented attitudes. The former 
proved to be technically inadequate and the latter is 
not relevant to the immediate problem. Discussion of 
these tests is therefore omitted here but may be found 
in the dissertation. 
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TABLE 1 
READING AND IQ CHARACTERISTICS OF SAMPLE 
Variable N Mean SD Range t 
1Q 
High status 99 118.31 14,08 86-149 ayes 
Low status 78 102.65 17.82 69-142 
Reading ` 
High status 99 8.03 2.06 3.4116 pi yas 
Low status 76 5.97 1.97 3.0-1.6 7 


** Significant beyond the .01 level. 


the first three. The standardized time of nine 
minutes was allowed for the 25 items. 

The VC test was specially constructed by 
the Æ from items reported in the literature and 
judged to be appropriate for the sixth grade. 
Attention was given to “culture fair” criti- 
cisms of item construction. The task of the S 
was to select the word that was most different 
from the remaining three. Ten minutes were 
allowed for the 35 items. 

Analogies tesis. Two analogies tests were 
given: one verbal (VA) and the other non- 
verbal (NVA). The NVA test was also part of 
the Lorge-Thorndike Intelligence Tests-Level 
3, Form A (14). The ten minutes allowed for 
the 30 items was one minute more than the 
standard time allowance. The VA test was 
specially constructed by the E in a manner 
similar to the VC test. Testing time for the 35 
items was 10/15”. 

The scoring for the classification and 
analogies tests was simply the number of 
correct answers. 

Abstract-Concrete Similarities (A-C Sim.) A 
number of recent studies have successfully 
revealed qualitative differences on vocabulary 
tests. The procedure, in multiple-choice tests, 
has been to have all of the choices correct but 
ranging in quality from concrete to abstract. 
The task of the S is to pick what he believes to 
be the best answer of the ones offered. This 
procedure has served to demonstrate develop- 
mental differences (11) as well as normal- 
schizophrenic differences (1). The present 
administration is an attempt to extend its use 
to a single age group with the independent 
variable being social status. 

The work of Chodorkoff and Mussen (1) 
served as a model for a specially constructed, 
objective vocabulary test suitable for sixth 
graders. The test, consisting of 35 words, was 
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to determine group preferences in type of 
definition selected. For each word, definitions 
in terms of class, description, example, func- 
tion, and incorrect choice were constructed. 
The order of definitions was nonsystematic. A 
sample item follows: Dog: (a) it has four feet, 
a tail, and fur (description); (b) it means a pet 
dog (example); (c) it’s a small animal (class); 
(d) something wooden (incorrect); (e) it barks 
and guards your house (function). No time 
` limit was set. Each protocol was given five 
scores corresponding to the number of times 
that each type of definition was used. 


RESULTS 
Status Comparison: Unmatched Groups 


Hypothesis 1. Testing of the first hypothesis 
involved the NVA, NVC, VA, and VC tests. 
Status comparisons were made by means of 
the Median test. Distribution free methods 
were used because most of the distributions 
were significantly skewed and remained so 
despite attempts at transformations. The 
skewness appears to be a result of excessive 
time limits. 

The HS children did better than the LS 
children on all of the tests of conceptual 
ability, three of the four tests showing sig- 
nificant differences. Hypothesis 1 was thus 
substantiated. Summary data for the status 
comparisons are presented in Table 2. 

Hypothesis 2. The hypothesis and accom- 
panying prediction were borne out. As can be 

` noted in Table 2, there were clearly significant 
differences between the status groups on the 
verbal tests. Only the NVC of the two non- 
verbal tests was found to have a median 
difference between groups great enough to be 
statistically significant at the .05 level, al- 
though the NVA was close to significance, 
The absolute point differences between the 


TABLE 2 


NONPARAMETRIC MEDIAN Tests or DIFFERENCES 
BETWEEN THE HS Anp LS Groups on THE NVC, 
NVA, VA, anp VC Tests 


Status Group 


Test with Larger Score 2” 
VC High -025-.05 
Sue High -05 ~.10 
VC High <.001 
VA High .025-.05 


* Median test values were corrected for continuity. 
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groups on the verbal tests were greater than 
the point differences on the nonverbal tests. 
Direct comparison, however, was not feasible 
as conversion to standard score units was not 
possible. However, findings discussed below 1m 
relation to the matched groups also support 
the hypothesis. To anticipate, no group dif- 
ferences were found when verbal scores were 
matched and comparisons made on nonverbal 
material, but, when the reverse was done, there 
were significant differences on the compared 
verbal material. 

Hypothesis 3. Testing the third hypothesis 

involved the determination of which group 
selected more of each type of definition on the 
A-C Sim. test. Table 3 presents the group 
totals for each type of definition. Chi square 
was computed to test the hypothesis that 
choice of definition is independent of status. 
Since the chi square for 4 df was 102.05, sig- 
nificant at well beyond the .001 level, the 
hypothesis of independence could be rejected 
with confidence, 
_ Examination of the contribution of each cell 
indicated that the class, example, and incorrect 
categories were solely responsible for the 
Significant chi square. The actual difference, in 
terms of number of class responses, is small 
but Statistically significant. There appears to 
be a greater proneness to use abstractions 
among the HS group than among the LS 
group. 

A separate analysis of the importance of 
reading ability for choice of definition indi- 
cated that the two-year difference in grouP 
reading scores was not sufficient to explain the 
Status differences. Proneness to respond in an 
abstract fashion was still more characteristic 
of the HS group than of the LS group even 
when the most inferior readers of the 
group were removed, 


The significant difference should not obscure 


TABLE 3 3 
E Low anp Hic Sratus GROUP 
ERCENTAGE Seiectine Eacu Tyre OF 
EFINITION 
(Total Number of Responses: HS = 3311, LS = 2642 
crcentages Derived for Each Group Separately’) 


COMPARISON OF TH 
N P 


Status |Descrip- Func- | Ex- a tal 
: tion | Class Gen atte en aa 
igh | 38.0 | 33.6 | 23 2 | 99° 
Low | 36:4 | 27:9 | 93°6| 9:2 | 3.0 | 100-2 
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the close similarity between the responses of 
the two groups. The type of choice is dis- 
tributed in the same order with both groups 
having as an order of preference, description, 
class, function, example, and finally incorrect. 
It is questionable whether the group difference 
on abstraction is socially as well as statistically 
significant. 


Stalus Comparisons: Matched Groups 


Data bearing on Hypotheses 4 and 5 were 
obtained by matching members of each group 
on the basis of score on one form of symbol- 
ism and comparing them on the test of the 
same function using the other form of sym- 
bolism. Four such matchings were made using 
the classification and analogies tests. Analysis 
was by means of the Sign test, corrected for 
continuity. i 

Hypothesis 4. Two nonverbal matchings were 
made. NVC scores were equated and VC 
scores compared, and NVA scores equated and 
VA scores compared. Both matchings showed 
a significant difference in favor of the HS 
group. Hypothesis 4 was confirmed. The data 
are summarized in Table 4. 

Hypothesis 5. Two verbal matchings were 
made. VC scores were equated and NVC scores 
compared, and VA scores were equated and 
NVA scores compared. The data, as shown in 
Table 4, indicate that the hypothesis was not 
confirmed by either of the matchings. It was 
not possible to assess the importance of order 
of attainment. 

Excluded cases. The number of matchings 
for each test was always less than the total 
number of cases involved for each of the tests. 
Analysis of the equated groups indicated that 
they were not representative of their status 
levels. They were composed primarily of the 
lower and middle range of scores of the HS 
population and the middle and higher range of 
scores of the LS population. It was possible to 
equate many Ss in the upper range of scores; 
however, the HS Ss in this range could not be 
completely matched by the LS Ss since there 
were more of the former in the sample. LS and 
HS Ss could not be matched on the lower 
range of scores to any appreciable degree 
because there were very few HS Ss who scored 
very low on the various tests while there were 
many LS Ss who did so. The excluded Ss 
represent a one-tailed extreme for each of the 


TABLE 4 
NONPARAMETRIC SIGN Tests OF DIFFERENCES 
BETWEEN MATCHED GROUPS ON THE NVC, NVA, 
VA, anp VC Tests 


Test Test 7 ee 
Matched Compared N HsS* LSe Ties  p* 
Nvc VC 68 45 22 .007 
NVA VA 60 39 21 .028 


1 

0 
vc NVC 56 22 32 2 = 
VA NVA 55 28 27 0 


* Sign test values were corrected for continuity. 
® Number of times member of that status exceeds matched 
mate. 


status groups, the implications of which fact 
will be considered below. 


Status Comparisons Matched for I nielligence 


Fifty Ss from each group were closely 
matched for intelligence. The mean IQ for the 
LS group was 111.96 and 112.00 for the HS 
group. All 28 of the excluded LS Ss had IQs 
below 100. All but one of the excluded HS 
Ss had IQs over 100. Again, we note that the 
equated groups were not unbiased samples of 
their parent population. There were no sig- 
nificant differences between the equated 
groups when comparisons by means of the 
Sign test were made on the NVC, VC, NVA, 
and VA tests. Unlike the findings of Findlay 
and McGuire (10) who used IQs ranging only 
from 90-110, the direction of differences did 
not favor either status. 


Discussion 


So far, emphasis has been on defining the 
functional importance of the variables, form of 
symbolism and proneness to use abstractions, 
for a generalized consideration of concep- 
tualization. This approach is in contrast to the 
few previous studies in the area which have 
generally considered restricted aspects of 
conceptualization. 

Examination of the “one-tailed status 
extremes” excluded in matching facilitated 
insight into the meaning and possible basis of 
the status differences. The excluded LS Ss, it 
will be remembered, were from the low range 
of scores in the LS population and the ex- 
cluded HS Ss from the high range of scores in 
the HS population. Only because of the ap- 
parently strong relationship between verbal 
performance and status were significant 
differences found for some of the equated 
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groups. On the other hand, when reading 
score was equated, as was done on the A-C 
Sim. test, and, in effect, only the middle range 
of Ss compared, no over-all status differences 
were found in type of definition. Again, there 
were no significant differences when the groups 
were based on IQ. It was obvious from an 
inspection of the frequency distributions in all 
of these instances that important segments of 
each of the status groups (the one-tailed 
Status extremes) and most particularly the 
LS, were not entering into the group com- 
parisons. 

A fraction of the LS group consistently 
scored low on most or all of the tests. Place- 
ment in the bottom quartile on three or more 
tests was arbitrarily fixed as the criterion for 
terming an S a “low scorer.” There were 19 
such individuals of the 82 in the LS group. 
There was no subgroup Corresponding to the 
“low scorers” in the HS Population although 
about a half-dozen individuals did score 
consistently low on more than two of the tests. 
However, even these individuals had scores 
which generally were higher than that of 
many of the LS “low scorers,” 

Having isolated this “low scorer” 
was possible to evaluate their role in 
duction of the differences between the HS and 
LS groups. Statistical comparison of the 
groups, with only the “low scorers” omitted, 
yield similar medians, distributions, ete, 
Comparison of the frequency distributions 
clearly indicates that the Statistically sig- 

nificant group differences arose much more out 
of the infrequency of low scoring HS Ss than 
from the lack of high scoring LS Ss. For the 
most part, the two status samples are very 
similar in test performance, and the status 
differences are a function of a relatively small 
number of Ss in the LS group. 
On the basis of the above analyses, it appears 
to be misleading to refer to general status 
differences in conceptual ability, even though 
such differences can be statistically derived. A 
more accurate statement of the situation as it 
appears from the present study would be that 
when one considers status differences in con- 
ceptual ability there is a greater possibility of 
finding Ss with low ability in an LS group 
than in an HS group but that, for the most 
part, performance of LS and HS Ss is similar. 
Why should there be a prevalence of “low 


group it 
the pro- 
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Scorers” in the LS sample and inferentially in 
the LS population? It is possible that an 
answer to this question may shed new light in 
the recurrent controversy over the source of 
Status differences in cognitive ability. Al- 
though the study was not designed to provide 
such an answer, it does suggest a hypothesis 
which might contribute to an answer: that 
the “low scorers” might be distinguished from 
the other LS Ss in that the former come from 
families that are not oriented in terms of up- 
ward social mobility while the latter come 
from upwardly mobile homes. As a 
concomitant of this attitude toward social 
mobility we might expect greater psychological 
impoverishment in the daily existence of 
“low scorers” than in the other LS Ss, Status 
in the present study was defined in objective 
sociological terms and not as the families 
considered themselves. It js likely that Ss 
from upwardly mobile LS families have psy- 
chological environments more similar to HS 
families than to sociologically determined LS 
families who are not so oriented. 

Social immobility, of course, is not the 
exclusive or even necessarily the most im- 
portant factor to consider in reference to the 
“low scorer” group. There unquestionably are 
some children in the group with inherently low 
intellectual endowment, as there are others who 
have been retarded by experiential impoverish- 
ment. Other factors to be evaluated are pre- 
natal environment, nutritive deficiencies, and, 
in this particular sample, a greater proportion 
of first generation Americans in the LS than 
in the HS group. 

It is obvious that the necessary data for 
explaining the basis for status differences will 
not be derived from the psychometric type 0f 
study (such as this one) which ends by showing 
differences, when an explanatory study shoul 
begin from this point. However, if other studies 
show effects similar to that of the “low scorer 
group in influencing the results and if pro- 
vision is made to analyze the characteristics ° 
these Ss and their families, the meaning ° 
average status differences may be clarified. 


SUMMARY 
This Study was designed to relate aoa 
economic status to conceptual ability. m 
basic problem was to evaluate the importano 
of (a) the form of symbolism by means 
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which conceptual ability was being tested and 
(b) the abstractness of the material for status 
differences in performance. 

A measure of status attitudes and four con- 
ceptual tests were administered to groups of 
99 high status and 82 low status white sixth- 
grade public school children. 

The major results were as follows: 

1. As groups, HS children did better than 
LS children on all tests of conceptual ability, 
particularly those involving verbal material. 

2. The HS group selected more definitions of 
an abstract type than did the LS group. 

3. The status groups were matched for a 
number of variables such as verbal score, 
nonverbal score, and intelligence, and then 
compared on a number of other variables. 
Generally there were no significant status 
differences. This procedure was used to de- 
termine whether the hierarchical order of 
concept attainment principle could be ruled out 
as a factor in HS verbal superiority. The 
results were inconclusive. 

4. The matchings usually excluded low 
scoring Ss in the LS group and high scoring 
Ss from the HS group. When the relative 
small number of lowest scorers in the LS group 
was removed from the group comparisons 
there were no status differences. The meaning 
of general group status differences was ques- 
tioned and further study of the “low scorer” 
group strongly suggested. 
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RECALL OF INTERRUPTED TASKS UNDER STRESS: A PHENOMENON 
OF MEMORY OR OF LEARNING?! 


ALBERT J. CARON AND 
Harvard 


ERHAPS no experimental approach to 
P repression has received more attention 
than the attempt to administer Zeigar- 

nik’s interrupted-task procedure (22) under 
ego-involving conditions. The rationale under- 
lying this use of the Zeigarnik technique held 
that ego involvement would cause incompleted 
and completed tasks to be viewed as failures 
and successes respectively, and hence, in 
accordance with repression theory, would lead 
to the expulsion of incompleted items from 
consciousness. The prediction followed, there- 
fore, that the typical “Zeigarnik” recall 
pattern (superior recall of incompleted tasks) 
would be replaced by a “repressive” recall 
pattern (superior recall of completed tasks). 
Since this view was first advanced, much 
evidence has accumulated (1, 2, 3, 7, 8, 18, 19) 
which indicates, not surprisingly, that such a 
reversal of the Zeigarnik ratio is not a function 
of stress alone, but depends, in addition, on 
personality factors—some individuals recalling 
a preponderance of successes and others yield- 
ing extreme Zeigarnik ratios under ego-in- 
volving conditions. While the isolation of the 
particular personality dimensions associated 
with each of these stress recall patterns offers 
a fertile field for personality research, the 
mechanisms assumed to underlie each pattern 
—namely, repression of failures in the case of 
selective recall of successes (S-recall) and 
superior trace stabilization of failures in the 
case of selective recall of failures (F-recall)— 
are still open to question. As others (15, 20) 
have observed, finding a selective recall 
preference for success items under stress is not 
definitive proof that failure items were re- 
pressed. An alternative explanation, selective 
learning in favor of success items, has not been 
ruled out. Likewise, a selective recall preference 
for failure items under stress may arise not 
from enhanced stabilization of their traces but 
rather from their being better learned in the 


1 This research was conducted while the senior author 
held a U.S.P.H.S. postdoctoral research fellowship at 
Harvard University. Grateful acknowledgment is made 
to Dr. Jerome S. Bruner for making available the 
facilities of his laboratory. 
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first place. The obvious experimental solution 
here—to equate successes and failures for 
original learning—is hard to achieve, for if 
failure items are really threatening, then the 
same factors which theoretically lead to their 
repression (or trace stabilization) may also be 
expected to impair (or improve) their learning. 

The aim in the present study was to provide 
a more feasible test of the memory versus 
learning explanations of both these recall 
patterns. The approach employed follows from 
an analysis of the assumptions underlying 
interpretation of each pattern as a memory 
or learning effect. Consider first the alternative 
assumptions with respect to S-recall under 
stress. 

As has been noted (4, 23), the essential 
postulate of repression theory is that threaten- 
ing events are not actually forgotten but 
persist in an unconscious state, continuously 
striving to regain consciousness. Moreovel: 
should the threatening character of represse 
events be allayed (via psychotherapy or othe 
means), they should re-emerge into conscious- 
ness, The selective learning position, on the 
other hand, implies no such restoration fof 
forgotten items: it assumes that decreased r°- 
call results entirely from a deficiency in origina 
registration. The two interpretations thus 0 er 
different predictions for the recall of events 
whose threatening nature is allayed after 
learning—the repression view implying ©?” 
hanced recall, the ‘selective learning vie” 
implying no change in recall, for such items- 
In the present study, two groups (both know? 
to exhibit S-recall under stress) were subjecte 
to a stressful interrupted-task procedure. A 
one group, however, recall was preceded by 3 
quasi-therapeutic session in which the test "a 
exposed as an experimental hoax. If the 
recall pattern is indeed due to repressio™ 
such exposure, by reducing associated anxiety? 
should restore the recall of incompleted ite” 
and hence cause this group to shift towa™ 
Zeigarnik pattern, If, on the other har’ 
selective learning underlies S-recall, the? 
such shift should occur. ex 

With respect to the F-recall pattern, 

ain 
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planation in terms of selective learning is much 
like that offered above for S-recall. Recall is 
said to be totally dependent on learning and, 
whatever the reason may be, failure items 
have registered more strongly than success 
items. The interpretation in terms of selective 
remembering, on the other hand, is based, 
according to Rosenzweig’s recent formulation 
(17), on the task-tension theory of Lewin (12) 
and Zeigarnik (22). According to this view, 
the traces of failure items are better stabilized 
than those of success items, because the former 
are laid down in a region of high psychic 
tension—tension aroused by instructions to 
perform a task and dischargeable only by its 
completion. Rosenzweig has called this a 
need-persistive” reaction to failure as con- 
trasted with the “ego-defensive” reaction of 
repression. This position predicts that any- 
thing done after learning to reduce task tension 
should bring about a reduction in the recall 
of failure items. The selective learning view, 
on the other hand, implies no recall change as 
a consequence of such post-learning manipu- 
lation. To test these predictions, two other 
groups—both known to exhibit F-recall under 
stress—were subjected to the same stressful 
situation described above, and recall for one 
group again followed exposure of the experi- 
mental hoax. If the F-recall pattern is indeed 
due to superior trace stabilization of failures, 
the revelation that incompleted tasks were 
actually impossible to solve should serve to 
discharge any residual tension associated with 
incompletion and thereby tend to reduce the 
recall of failures. No such reduction in recall 
of failures should occur, on the other hand, if 
this effect is due to differential learning. 

A final question examined in the present 
study may now be mentioned. If our results 
should indicate that both the S- and F-recall 
patterns are a function of learning rather than 
retention, does this mean that there is no 
evidence in the present experiment for the 
operation of mnemonic processes? This ques- 
tion is prompted by consideration of an aspect 
of the present kind of procedure which dis- 
tinguishes it from the original Zeigarnik 
technique, a difference heretofore neglected by 
most writers. To construct a situation suffi- 
ciently threatening for college students, the 
experimental tasks have usually been presented 
in the guise of an intelligence test and indeed 
are so presented here. In so doing, however, 
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the procedure is given a unity and integration 
which is not present in the typical Zeigarnik 
situation, where S is merely asked to perform 
a series of disparate and ostensibly unrelated 
tasks. Under intelligence-test conditions, then, 
it is entirely possible that S’s defensive reac- 
tion is oriented more toward the test as a 
whole, i.e., toward the single experience of 
examination failure, rather than toward par- 
ticular items within the test, and hence 
selective recall of individual items may not be 
a measure optimally sensitive to S’s invoked 
defense. Memory for the entire test situation 
would seem more pertinent, but for obvious 
reasons such a measure is unfeasible. To de- 
termine whether there was indeed a mnemonic 
reaction to the total test situation in this 
study, an alternative indicator in the area of 
perception was employed. The rationale for 
its use is as follows: If S tends to repress an 
experience, aspects of that experience should 
be relatively inaccessible as hypotheses for 
perceptual recognition. Likewise, if the trace 
of an experience is well stabilized for S, aspects 
of it should be hyperaccessible as recognition 
hypotheses. Thus, following the stress situa- 
tion, words related to it ina general way were 
presented tachistoscopically (along with 
matched neutral words). A repressive reaction 
to the entire situation should result in height- 
ened recognition thresholds for such words: 
enhanced trace stabilization, on the other 
hand, in lowered thresholds. Moreover, if these 
effects are truly memorial, and not due to 
registration processes, they should cease to 
occur when exposure of the experimental hoax 
precedes the perceptual test. 


METHOD 
Subjecis 


Since none of the personality or performance meas- 
ures previously shown to relate to selective recall under 
stress have been conclusively validated, it was not 
considered appropriate to use any one or combination 
of them as a basis for preselecting the S-recallers and 
F-recallers in our various treatment groups. As an 
alternative to preselection, it was decided to administer 
a large battery of promising measures? to randomly 
selected Ss already tested in our experimental condi- 
tions, to factor analyze these measures, and, from the 
correlations between the resulting factors and the 


2 The measures used included tests related rationally 
as well as experimentally to each of three personality 
areas: achievement motivation, hysteria, and ego 
strength. 
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selective recall scores of our stress group, to isolate a 
stable predictor that could be used post hoc to subdivide 
all treatment groups. This analysis yielded four orthog- 
onal factors, only one of which was at all related to 
recall direction in the stress condition. The measure 
with the highest loading in this factor (and, incidentally, 
with negligible loading in the other factors) was the 
dichotomous variable of private prep school versus 
public high school attendance, and indeed this single 
variable was more strongly associated with our recall 
measure? than any other variable or combination of 
variables defining this factor (see Footnote 4 for the 
major variables), Thus, the public-private dichotomy 
involved fewer dimensions than one might at first have 
supposed, and also accounted for most of the factorial 
relationship with recall direction. These considerations, 
coupled with considerable evidence suggesting that the 
reactions of Harvard public and private school Ss 
should be quite different in an intelligence-test situa- 
tion," led us to use this variable to subdivide our groups 
into S-recaller and F-recaller components. 

The Ss were chosen in the following way: Invitations 
to participate in the experiment were mailed out to 
every tenth name in the list of Harvard freshmen. The 
list was gone through twice in this manner, starting at 
a different name each time. The letter attractively de- 
scribed the experiment as a study of “symbolic proc- 
esses,” and also emphasized the remuneration for par- 
ticipation. About half of the men to whom letters were 
mailed volunteered—125 in all, of whom 117 com- 
pleted the experiment—and they were randomly 
assigned to the several experimental conditions. The 
number of public and private school Ss in each condi- 
tion was roughly equivalent. 


Sequence of Procedures 


The order of procedures for all Ss was as 
follows: (a) initial digit recall test (4 minutes) ; 


* Chi square was significant at less than the .025 
level with a one-tail test—use of a one-tail test being 
justified since recall directions were predicted from the 
findings discussed in Footnote 4. 

*McArthur’s studies (13, 14) of the personalities of 
Harvard public and private school boys, for instance, 
indicate that the former tend to have a stronger drive 
for achievement and to be more sure of their personal 
and intellectual worth than the latter, Research by 
Atkinson (3) provides further support for this view, 
Ss with high need for achievement recalled more failures 
under stress and more successes under neutral condi- 
tions, while Ss with low need for achievement did the 
opposite—results which directly parallel our recall] 
findings for public and private school Ss respectively, 
Finally, we may consider the performance of public and 
private school boys on some of our independent meas- 
ures. The public school students made significantly 
higher grades at Harvard than those from private 
schools (p < .02 with a two-tail test), although the two 
groups did not differ in intelligence. Moreover, on the 
factor mentioned above, public school attendance was 
associated with such achievement variables as level of 
aspiration, while private school attendance was related 
to such indications of personal insecurity as the Knut- 
son Personal Security Inventory (11) and the Social 
Introversion scale of the Guilford STDCR battery (10) 
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(4) scrambled sentence tasks (50 minutes); 
(c) second digit recall test (4 minutes); (d) 
interim discussion with (8-10 minutes); 
(e) initial recall of scrambled sentences (5 
minutes); (f) determination of perceptual 
recognition thresholds (60 minutes); (g) second 
recall of scrambled sentences (5 minutes); 
(4) third recall of scrambled sentences, two 
days later (5 minutes), 


Tests 


Digit recall. Half of the Wechsler-Bellevue Digit 
Recall Test (21) was administered before, and half after, 
the scrambled sentence tasks, one instance of each 
forward and backward couplet being presented in each 
administration. All responses were written from left to 
right, and writing never began until Z finished reading 
the digits. The number of digit sequences completely 
correct provided the metric. Scores for the first hall 
alone constituted a check on the equality of memory 
ability for the several experimental groups; the differ- 
ence between scores for the first and second halves 
provided a measure of the anxiety induced by the inter- 
vening work on scrambled sentences,’ 

Scrambled sentences. The experimental tasks were 
sixteen 20-word sentences similar to ones used by Alpet 
(1), Eriksen (6), and Caron (5). Each sentence Was 
divided into 8 to 10 two- and three-word phrases an 
was presented in scrambled form, S’s job being to Te 
arrange them into a meaningful sentence. Half of the 
Sentences, randomly distributed in the series of sixteen% 
were unsolvable. Contents of the sentences were diver- 
sified so that recall of one would be no aid to recall © 
another, and the completed and incompleted sentences 
were equated for recall difficulty in a previous study [see 
Caron (5)]. To facilitate group administration, i 
Sentences were presented in book let form, one sentence 
to a page. A 214-min, time limit was set for each solt 
tion and all Ss were required to start a new sentence ^ 
the same time. As an aid to recall, cach sentence ia 
named by selecting one phrase that best representee i 
meaning and Placing this Phrase above the scramblee 
Sentence in the test booklet. Æ called out this “name 
Phrase at the start of each problem. For both initi® 
and later recall tests, S had no prior knowledge ae 
recall would be requested. In recalling sentences, S T 
required to write the name of a sentence or any other 
tey phrase or word sufficient to identify it wit 
n five minutes being allowed a 
each recall session. The measure of recall direction is a 
the number of Completed minus the number of inc? 

Perceptual materials. Immediately after initial recall, 
perceptual recognition thresholds were determine ple 

ing i general way to the ae 
utral words of matched struc’ 


i u 
tested; the corresponding Pe 


wor a š 
5 were Sunshine, cookies, sketches, agric 


= — diets 
Tor the use of digit recall as a measure of an*" 
See Rapaport et al, (16). 


+ eee ee 


RECALL OF INTERRUPTED TASKS UNDER STRESS 


leisurely, channeled, and regard. Ss, tested in groups of 
four, were seated five feet from a ground glass screen ina 
room illuminated solely by a 40-watt fluorescent ceiling 
bulb one foot behind S. Each word, no more than 14 
inches in length, was flashed at }400-second, beginning 
at subthreshold brightness levels and rising in two-volt 
steps until all Ss recognized the word. Threshold was 
taken as the voltage level at which correct report first 
occurred. The order of threatening and neutral words 
was random, with matched words always being seven 
positions apart. To prevent communication between 
Ss, responses were written, E observing these and tell- 
ing each S to stop and turn over his paper when he had 
written the correct word. 


Conditions 


Stress. The stress condition was structured with two 
objectives in mind: (a) to convince S that he was taking 
a highly valid and discriminating intelligence test, and 
(b) to make him feel that he had done very poorly on 
it. With respect to the former objective, a number of 
techniques were used: In the first place, the scrambled 
sentence task was introduced as a test recently devel- 
oped by the “Stanford people” in connection with a top 
level government project, a brief comment being added 
on the relationship of the Stanford people to the Stan- 
ford-Binet test. Second, Æ implied that the government 
project involved careful selection of personnel for criti- 
cal intelligence work, and hence that the present test 
had been designed to provide precise discrimination in 
the average to superior range of adult intelligence. 
Third, bogus validity data were presented specifying 
academic and professional success as criteria. Finally, 
the title “Stanford Reasoning Test” was printed on the 
cover of the test booklet, and beneath that were printed 
“name,” “age,” and “IQ.” (IQ was to be filled in, if 
known. Accomplices, discussed below, always filled in 
high IQs.) That an intelligence test would be adminis- 
tered in a study ostensibly concerned with “symbolic 
processes” was rationalized on the ground that E wished 
to compare the symbolic performances of average and 
highly intelligent students. 

"The following means were used to make S feel that 
he had done badly on the test: First, E stated that most 
Harvard students should perform very well on this test, 
since they were undoubtedly at the high end of the 
distribution of adult intelligence. Second, Ss were 
tested in groups of six, and each such group included 
two accomplices who pretended to solve all the tasks 
rapidly and exuded an air of confidence throughout the 
session. Finally, the four naive Ss received alternate 
forms of the test such that when any two of them failed 
to complete a task the other two Ss completed that same 
task, the result being that for any single task, four of 
the six testees achieved solution while two did not. 

After completion of the exam, the second digit recall 
test was administered, and was immediately followed 
by a period of 8-10 minutes during the first part of 
which E made remarks intended to reactivate feelings 
of stress (referring to the exam’s highly discriminative 
nature, the fact that scores would be made available, 
etc.), and during the remainder of which he scheduled 
Ss for subsequent sessions. Such scheduling was done 
at this point in order to make the interim period for the 
stress condition as long as that for the relief condition, 
and yet prevent Ss in the stress condition from discuss- 
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ing the exam among themselves. Half of the stress Ss 
were kept under stress for all three recall sessions, while 
the other half were relieved just prior to the second 
recall test. 

Relief. This condition was exactly the same as the 
stress condition, except for the contents of the 8-10- 
minute interim period mentioned above. Rather than 
reactivate feelings of stress during this period, Æ at- 
tempted to relieve such feelings. He said the following: 
“Everybody sit back and relax. From the looks on your 
faces I gather that this was a pretty rough test—but 
perhaps you didn’t do as badly as you think. You'll 
hate me when I tell you this, but believe me, it was all 
in the interest of science...” E then explained that the 
entire situation had been a hoax—that half the tasks 
had been unsolvable, that the two apparent geniuses 
in the group had really been E's accomplices, and so on. 
This revelation released a flood of affect—best described 
as a mixture of surprise and tremendous reliei—which 
was inevitably followed by a round-robin of descrip- 
tions of one’s feelings and reactions during the session. 
The subsequent airing of these feelings gave the session 
a definite therapeutic tone. 

Neutral. In contrast to the stress condition, the aim 
of the neutral condition was to create as informal and 
relaxed an atmosphere as possible, and to focus atten- 
tion on the tasks rather than on S’s performance. To 
this end, the scrambled sentence tasks were introduced 
as a pretest of materials for a future experiment. The 
fact that the tasks were timed was rationalized by tell- 
ing S that the purpose of the pretest was to eliminate 
tasks requiring too much time, E supposedly being 
interested in tasks that took no longer than about two 
minutes to solve. Moreover, whereas the timing for the 
stress condition was done in a test-like manner with Æ 
holding his stop watch conspicuously, the timing in the 
neutral condition was done unobtrusively and casually. 
The booklet of scrambled sentences, in addition, had 
no title on it, and S was told that he need not supply his 
name. Finally, no accomplices were present and all Ss 
had the same form of the test, so that everyone com- 
pleted, or failed to complete, any given task. The 
8-10-minute interim period between the second digit 
recall test and initial recall of the scrambled sentences 
was spent scheduling Ss for further sessions and having 
them fill out parts of some questionnaires. 


RESULTS 


Before considering the results which bear 
directly on our major questions, let us examine 
the data pertaining to a basic assumption of 
the study; namely, that the cognitive effects 
obtained in our stress condition were due to 
the ego-involving operations themselves, rather 
than to something inherent in the test materials 
or to subject differences alone. Comparison of 
the stress and neutral conditions for each of 
the subject groups indicates that this as- 
sumption is entirely tenable: Private and 
public school groups yielded significant dif- 
ferences (p < 01 in each case) between stress 
and neutral conditions on both the recall 
difference (Table 1, Column 2) and the per- 


376 


TABLE 1 
mai Dicir RECALL (D), SCRAMBLED SEN- 
vE RECALL SCORES, AND PERCEPTUAL RECOG- 
NITION SCORES FOR S-RECALLERS (PRIVATE 
Scoot Ss) AND F-RECALLERS (PUBLIC 
SCHOOL Ss) UNDER THREE EXPERI- 
MENTAL CONDITIONS® 


Condition D CR-IR|TR| CR} IR | Pb 
S-recallers 
(Private Schoo] Ss) 
Stress 9.5 +1.1 | 5.5) 3.3] 2.21+6.0 
(N = 19) 
Relief 9.3 +0.6 | 4.4) 2.5! 1.9 10.9 
(N = 19) 
Neutral 8.6 —0.2 | 6.4) 3.1] 3.3/1.7 
(N = 15) 
F-recallers 
(Public School Ss) 
Stress 9.0 +0.1 | 5.3) 2.7] 2.61-2.6 
(N = 23) 
Relief 10.2 | 40.1 | 5.7] 2.9] 2.8/0.7 
(N = 23) 
~~ Neutral 9.4 +1.2 | 6.8! 4.0) 2.8!-43.8 
(N = 18) 


* The scrambled sentence recall and perceptual recognition 
measures include mean difference between number of completed 
and incompleted tasks recalled (CR-IR), mean total number of 
tasks recalled (TR), mean number of completed (CR) and incom- 
pleted (IR) tasks recalled, and mean 
ence for threat-related and matched 

PA positive difference means higher thresholds for thre; 
related than for neutral words. The brightness difference in y 
is twice the size of cach entry, 


at- 
olts 


ceptual threshold scores (Table 1, Column 6), 
Moreover, the directions across recall and 
perception were completely consistent for 
each group, private school Ss exhibiting S- 
recall as well as “repressive” perceptual 
patterns under stress but just the opposite 
tendencies under neutral conditions, public 
school Ss doing exactly the reverse, (The inter- 
action Fs are significant beyond the .01 level 
for each variable. See Table 2.)57 


5 Lack of a strong recall tendency in favor of incom- 
pleted tasks in any of our subgroups indicates the 
presence of a general recall bias in favor of completed 
tasks. Thus, when the term “F-recall Pattern” is em- 
ployed in the present study, it is being used ina relative, 
not an absolute, sense. That this term is nevertheless 
being used appropriately, i.e., to specify a type of cog- 
nitive reaction, is indicated by the finding, noted earlier, 
that our group differences directly parallel those of 
Atkinson (3) for high and low need achievers tested 

under similar conditions. 
7 In all our analyses of variance, the form of the test, 
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For both private and public school Ss, total 
recall drops significantly from neutral to stress 
conditions (Table 1, Column 3). However, 
while for private school Ss this is due entirely 
to a decline in recall of failures (Column 
5)° for public school Ss it is completely de- 
termined by a decline in recall of successes 
(Column 4). These losses assume added im- 
port when one compares the groups on our 
measure of recall ability—digit-retention. 
Column 1 of Table 1 shows the mean pretest 
digit-retention scores for the subgroups. The 
analysis of variance of these scores yielded no 
significant or near significant Fs (Table 2). 
Consider now the findings that indicate 
whether these losses are due to mnemonic or 
learning processes, 

1. Is the S-recall pattern a function of re- 
pression of failures or Selective learning in favor 
of successes? The critical datum here is the 
degree of shift in the recall difference score 
from stress to relief conditions for private 
school boys (‘Table 1, Column 2). Although @ 
slight shift occurred, it was far from sig- 
nificant, £ being less than 1. Recall of incom- 
pleted tasks (Table 1, Column 5) shows @ 
nonsignificant decrease from stress to relie 
conditions, whereas, if a repression effect were 
Operating and given the mnemonic compara- 
bility of our groups, one would expect; 
anything, an increase (memory for failures 
being restored by the relief treatment). Recall 
of completed tasks (Column 4) and total 
recall (Column 3) also exhibit nonsignificant 
declines. In Sum, then, no immediate reca 
shift appeared in the relief group. To 
determine, on the other hand, whether 2 
delayed shift occurred for this group of fOr 
other groups; namely, for stress Ss who were 
Subsequently relieved, and for nonrelieve 
Stress Ss (the last as a function of time laps® 
alone), we turn to the data of Table 3 (Cok 
umns 1-6). None of the recall difference? 
recorded there (for CR-IR, CR alone, oF I 
which 


1e., which particular tasks were completed and | nt 
Wore incompleted, was found to be a nonsignifie 
Source, and hence its variance was added to the wit 
cells variance, (9): 
ê These results are similar to those of Glixm: a as 
Whereas Glixman, however, regards such finding 
indicative of a repression process, we are sug 


that they do not Point unequivocally to such 4 
clusion, 
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TABLE 2 
161T RECALL (D), SCRAMBLED SENTENCE RECALL SCORES, AND PERCEPTUAL 
LERS (PRIVATE SCHOOL Ss) AND F-RECALLERS (PUBLIC SCHOOL Ss) UNDER 
‘THREE EXPERIMENTAL CONDITIONS 


ANALYSES OF VARIANCE OF INITIAL D 
RECOGNITION SCORES FOR S-RECALI 


€ D CR-IR TR CR IR P 
Source 
df | ms F ms | F | ms | F | ms | F | ms F ms | F 
Type of Ss 1 | 3.86 0.76 5.80 0.56 2.62 | 1.19 |111.342.15 
Conditions 2 | 5.78 | 1.16 | 0.96 23.25 |3.82*| 7.46 |3.50*| 4.84 | 2.20 | 13.12 
Ss X Conditions 2 | 6.32 | 1.27 |13.57 |5.19*| 5.50 5.38 |2.53 | 4.55 | 2.07 |458.50/8.84* 
Error 110 | 4.99 2.61 6.09 2.13 2.20 51.87 


* For 2 and 110 df, an F of 3.09 is significant at the .05 level; an F of 4.81 is significant at the .01 level. 


TABLE 3 
MEAN DIFFERENCE BETWEEN First AND SECOND RECALL SESSIONS, AND BETWEEN First AND THIRD RECALL 
Srsstons, FOR CR-IR, CR, anp IR 


F-recallers (Public School Ss) 


Condi- | Condi- S-recallers (Private School Ss) 
tion of | tion of : f 5 i; 
Initial | Delayed} 2nd-1st sessions 3rd-1st sessions 2nd-1st sessions 3rd-Ist sessions 
Recall | Recall 
CR-IR |CR/IR| CR-IR | CR| IR CR-IR |CR|IR| CR-IR | CR} IR 
fe | 
Stress stress —0.1 0.2) 0.3) —0.5 0.0) 0.5} —0.7 0.2) 0.9 1 0.2 0.1 
T (N = 13) (N = 12) (N = 10) (N = 10) 
stri i 5 0.0) 0.0} —1.2 —1.0| 0.2 0.0 0.3) 0.3 0.4 0.3|—0.1 
= | eg W = 6) W = 13) W = 13) 
i = = z ‘il 
relief elie 0.1 | 0.3/0.2) —0.2 |—0.2} 0.0) —0.3 | 0.0) 0.3) —0.3 0.2) 0 
i selee hoig ois (N = 16) (N = 23) (N = 23) 
neutra a 2. 0.4} 0.2 0.2 0.1/—0.1) —0.5 0.0) 0.5 —0.6 —0.2) 0.4 
eutral neutral uw 0; 15) (N = 15) (N = 18) (N = 18) 


: . s, 9 
alone) are significantly different from zero, 


hence implying no delayed restoration of 
forgotten items. The present findings thus 
do not support predictions derived from a 
repression interpretation of the S-recall pat- 
tern, but are consistent with those derived 
from a selective learning interpretation. 

2. Is the F-recall pattern a function of se- 
lective remembering or selective learning in favor 
of failures? To answer this question, we must 
examine the degree of shift in the recall dif- 
ference score from stress to relief conditions for 
public school Ss (Table 1, Column 2). No such 
shift occurred, the two means being identical. 
Recall of incompleted tasks alone, moreover, 
shows a nonsignificant increase, whereas the 
task-tension theory would predict, if anything, 
a decrease. Recall of completed tasks and total 


® As can be seen from Columns 5 and 6, the CR-IR 
difference of —1.2 (p < .075) is a function of decreased 
recall of completed tasks rather than increased recall 


of incompleted tasks. 


recall also show negligible increases. Here 
again, then, no immediate recall shift occurred 
in the relief condition. Further, there was no 
delayed shift for any of the subgroups. As 
indicated in the last six columns of Table 3, 
none of the differences for CR-IR, CR alone, 
or IR alone, were significantly different from 
zero. Our findings, therefore, do not confirm 
predictions derived from a task-tension inter- 
pretation of the F-recall pattern, but once more 
are consistent with those implied by a selec- 
tive learning view. 

3. Is there a mnemonic reaction to the test as a 
whole? The data relevant to this question are 
the threshold shifts from stress to relief 
conditions for test-related words (compared to 
matched neutral words). According to Table 
1, Column 6, private school Ss in the relief 
group show a significant return (p < .05, 
two-tail test) to the level obtained under 
neutral conditions, whereas public school Ss do 
not shift significantly. That the perceptual 
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defense pattern dissipated for private school 
Ss when they were relieved, suggests that a 
genuinely repressive reaction—a memory 
rather than a learning phenomenon—occurred 
for these Ss with regard to the threatening 
situation as a whole. Since the perceptual 
vigilance reaction of the public school Ss, on 
the other hand, did not dissipate in the relief 
condition, it would seem that this reaction is 
due to a learning rather than a memorial 
process, 


Discussion 


Three propositions summarize the present 

results. 1. Both the S- and F-recall tendencies in 
the present study are due to a selective learning 
rather than a selective remembering mechanism, 
The pertinent datum here is the following: 
Among the boys from private schools, i.e., 
those exhibiting an S-recall pattern, the tend- 
ency to recall failures was no greater for 
relieved than for nonrelieved Ss; likewise, 
among the boys from public schools, i.e., those 
exhibiting an F-recall pattern, the tendency to 
recall failures was no less for relieved than for 
nonrelieved Ss. 2. A repression mechanism was 
found for S-recallers, but with respect to the 
stress situation as a whole rather than for 
specifically failed items. The evidence here: 
Nonrelieved private school boys manifested 
perceptual defense for words related in a 
general way to the examination situation, 
whereas those who were relieved showed no 
such perceptual defense. 3. Instead of @ com- 
parable process of enhanced retention of the 
total stress situation, F-recallers showed en- 
hanced registration for the total situation, Thus, 
although public school boys under stress were 
perceptually vigilant for test-related words 
(relative to public school boys under neutral 
conditions), this effect did not dissipate in the 
relief condition. 

Before we can accept these statements, two 
critical procedural issues require reexamina- 
tion: namely, (a) was the stress situation 
sufficiently threatening; and (b) if so, was the 
relief treatment really therapeutic? Consider 
first the question of threat. 

Needless to say, the present data would have 
no bearing on a theory of repression if the 
stress situation had not induced intense ego 
threat. That such threat did indeed occur may 
be inferred from several lines of evidence— 
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observational, introspective, and experi- 
mental. In the first place, Ss in the stress 
condition showed many overt signs of emo- 
tional disturbance (little or none of which 
appeared among the neutral Ss). When 
stumped by a problem, they would squirm 
nervously or shake their heads in disgust 
(sometimes swearing simultaneously) or would 
stare at the page with a very pained expres- 
sion. Further, there were many attempts to 
conceal one’s difficulty from others, either by 
pretending to have finished problems not 
actually completed, or, when able to solve a 
problem, by trying to make one’s success very 
obvious to competitors (e.g., putting one’s 
pencil down with a bang or yawning loudly 
while stretching). Again, depressive and 
hostile reactions were quite prevalent follow- 
ing the exam, many Ss sitting morosely in 
their chairs, others openly challenging the 
worth of the test. Lastly, expressions of great 
relief—deep sighs, postural relaxation, laugh- 
ter—almost always appeared when the rusë 
was finally explained. As to introspective 
evidence: First, most Ss voluntarily reporte 
that they had been extremely concerned prior 
to, and tremendously relieved following 
exposure of the hoax. They attributed emo- 
tions to themselves ranging all the way from 
intense feelings of inadequacy (“I said z 
myself, ‘See, you really don’t belong @ 
Harvard’ ”) to violent hostility (“I wanted 9 
get that supercilious bastard sitting beside 
me.”) Secondly, continued questioning o 
vealed that although some thought the test l 
poor one, no one had doubted its reality. wit : 
regard to experimental data, finally, consider 
the results for our indicator of anxiety: dig! 
recall. Whereas recall of digits showed s 
marked pretest to posttest improvement a 
Ss in the neutral condition, Ss under ane 
showed no such improvement, the F ain 
Conditions being significant well beyond os 
01 level. The most plausible explanation | 
the latter Ss’ deficiency would seem to be a 
disrupting effects of anxiety generated a 
the scrambled sentence test. These gana i 
sum, would seem to support the assumpti e 
that intense ego threat was aroused i? 
present study. . Can 
Let us turn now to the second question: ga 
exposure of the hoax really be consideren ? 
therapeutic, i.e., anxiety-reducing, techmd 
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Even if it were conceded that anxiety had been 
aroused in our stress condition, the present 
findings could be legitimately challenged on the 
ground that this anxiety was not effectively 
reduced in the relief condition. Such a challenge 
might proceed as follows: In the typical 
clinical setting, the therapist’s efforts to un- 
cover repressed events meet with strong 
defensive resistance. Hence, recovery of re- 
pressed material requires a great deal of time 
and involves a special kind of intimacy 
between patient and therapist, neither of 
Which occurred in the present relief treatment. 
The point is well taken, but it overlooks two 
important differences between repression as it 
ordinarily occurs and repression as precipi- 
tated in the present experiment. First, in 
ordinary circumstances, the fearful event is 
unknown to the therapist, having occurred 
before therapist and patient began their 
relationship. Thus, as long as the therapist is 
ignorant of the actual situation and cannot 
confront the patient with reality, the patient 
may safely continue to hide behind his defense 
and deny the event. In the present situation, 
however, the individual is faced with an 
omniscient therapist—one who produced the 
threatening event and who is as much an 
authority on its nature as the patient. To 
continue repressing in the face of such omnis- 
cience is to render one’s contact with reality 
precarious at best. Second, and perhaps more 
critical, there would seem to be but two ways 
in which a threatening event may be made less 
fearful: (a) by effecting a radical change in 
the patient’s motivational structure (eg. 
eliminating his fear of intellectual incom- 
petence) so that the event (e.g., exam failure) 
no longer poses a threat; or (b) by altering the 
significance of the event so that it is no longer 
motivationally relevant. The former is the 
proper goal of the therapist in the usual 
clinical setting. The latter is what was ob- 
viously involved in the revelation of the 
experimental hoax to our Ss: the failure 
experience had initially been created by de- 
ception and was now being restored to its true 
meaning as a psychological gimmick. E’s 
revelation does not make S any less of a 
repressor or eliminate his problem with respect 
to intellectual competence (nor was it intended 
to), but merely renders this particular event 
irrelevant to that problem and hence no 
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longer something to be defended against. In 
the light of both these points; then, the 
assumption that effective therapy occurred in 
the present experiment seems entirely tenable. 

Two general conclusions follow from the 
present findings. One is methodological, and 
touches on the question of criteria for a suit- 
able test of repression. The other concerns the 
particular mechanisms that underlie the 
avoidant and adient cognitive reactions ob- 
tained here. Consider each of these in turn. 

1. In attempting to provide an experimental 
demonstration of repression, the general 
practice has been to compare the recall of 
certain “nonthreatening” aspects of the 
experimental situation with that for certain 
threatening aspects. In particular, it has been 
common to examine the recall of failures in 
contrast to successes on a bogus intelligence 
test. The present results impute this approach, 
for they suggest that repression embraces only 
the most general aspects of the stress situa- 
tion, whereas recall of particular threatening 
or “nonthreatening” aspects of that situation 
is a function of learning. This finding seems 
entirely reasonable, for when a person has 
failed half the items on a test, what is likely to 
be threatening for him is not that he has 
failed those particular items but that he has 
failed the fest. Nor can the items he has passed 
be entirely free from threat if the total situa- 
tion in which they are embedded arouses 
anxiety. It would thus seem necessary, in 
experimental research on repression, to 
determine precisely what category of materials 
is relevant to the subject’s conflicts and fears. 
While recent experiments in this area have 
been careful to satisfy the previously neglected 
criterion of motivational relevance of experi 
mental materials, the present findings strongly 
suggest it would also be well to consider the 
cognitive relevance of these materials, 

2. That both public and private school Ss 
showed opposite and significant differences 
between stress and neutral conditions, and 
that the directions of these differences were 
completely consistent across recall and per- 
ception, indicate that two quite different 
defensive reactions to stress occurred in this 
study. One—an avoidant reaction by private 
school Ss—was indeed shown to be repressive 
as far as the total test situation was concerned 
but, with regard to particular items, was 4 


380 


matter of registration differences. The other— 
an adient reaction by public school Ss—was a 
case of registration differences at both levels. 

ne may properly inquire at this point what 
the particular processes might be which 
determine the various registration differences 
found. Osgood (15), among others, has sug- 
gested that differential rehearsal may account 
for such learning superiorities. The possibility 
of covert review of failures by Harvard public 
school Ss indeed seems especially reasonable on 
the assumption that these Ss are high 
achievers. While such individuals are not un- 
concerned about failure, they undoubtedly 
have a history of repeated reward for efforts to 
counteract initial failure. Hence these Ss 
would be expected to ruminate over their 
difficulties as a cognitive precursor to con- 
structive adjustment. In the case of private 
school Ss, covert review of successes might be 
expected as a kind of ego-bolstering reaction. 
Another possibility for these Ss is that anxiety 
over inability to solve items acts to disrupt 
their registration. These two mechanisms 
are, of course, by no means mutually ex- 
clusive. 


SUMMARY 


The present study proposed to answer three 
questions: 1. Is superior recall of successes in 
an intelligence test situation (S-recall) a 
function of selective forgetting (repression) of 
failures or selective learning in favor of suc- 
cesses? 2, Is superior recall of failures in this 
situation (F-recall) a function of selective 
remembering or selective learning in favor of 
failures? 3. Is there a mnemonic reaction to the 
test as a whole? The first two questions were 
examined by comparing the recall preference 
of nonrelieved S- and F-recallers on a bogus 
intelligence test (involving solvable and un- 
solvable scrambled sentences) with that for 
comparable relieved Ss, selective remembering 
theories predicting a shift from nonrelieved to 
relieved conditions for each recall group. The 
third question was examined by comparing the 
perceptual recognition thresholds of the above 
relieved and nonrelieved Ss on exam-related 
and matched neutral words. Forty-two Ss each 
were tested in the relief and nonrelief con- 
dition, with 33 more tested under neutral 
conditions. Prior attendance at a private 
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versus a public high school was used as the 
indicator of S- and F-recallers respectively. 
The findings were as follows: 1. Both the 
S- and F-recall tendencies were due to a 
selective learning rather than a selective ka 
membering mechanism. 2. A repression 
mechanism was demonstrated for S-recallers 
with regard to the stress situation as a 
whole. 3. A comparable process of enhanced 
retention of the total stress situation was not 
demonstrated for F-recallers; rather, the effect 
here was due to enhanced registration. Two 
conclusions were drawn from these findings: 
First, that repression was found for the total 
test situation rather than for specific items, 
suggests experimental tests of repression must 
fulfill a criterion of cognitive relevance of test 
materials. Second, the kinds of enhanced 
registration found may have been due tO 
covert rehearsal, and possibly also to disruption 
of the registration of other items by anxiety. 
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GROUP CLIMATE AS A FACT 


OR IN THE RELEVANCE AND 


ACCURACY OF SOCIAL PERCEPTION! 


RALPH V. EXLINE 
Fels Group Dynamics Center, University of Delaware 


HIS PAPER reports on a study of accuracy 

of social perception as a function of the 

| “climate” of social interaction, Special 

attention is paid to the relevance of the per- 

ceived, estimated, or judged social behavior of 
others to the goals of the perceiver. 

Much of the research on accuracy of social 
perception has been concerned with accuracy 
as an attribute of the individual perceiver (3, 
8, 12). Results of certain of these studies, how- 
ever, suggest the merit of studying the rela- 
tionship between accuracy of perception and 
the social context in which perception is 
measured. Conrad (5) and Chowdhry and 
Newcomb (4), for example, suggest that ac- 
curacy is a function of the relevance of the 
perceptual task to the motives of the perceiver, 
while Suchman (22) speculated that group dif- 
ferences in accuracy of task-oriented perception 
are caused by the nature of the interaction 
within the groups studied. 

The theoretical relationship often assumed 
to exist between motivation and perception 
suggests that the study of perceptual accuracy 
in a social context may be facilitated by con- 
sidering perception in terms of a path-goal 
relationship. Thus, according to Lewinian 
theory (16), once a person defines a goal, he 
attempts to move from a position outside toa 
position within the goal region. Ina relatively 
unstructured situation where a goal, but not 
the means to achieveit, has been defined, move- 
ment toward the goal requires that he direct 
energy to the differentiation of a path leading 
toward the goal. This thinking can be gen- 
eralized to group situations and seems espe- 
cially pertinent where information concerning 
the opinion of other group members is a neces- 
sary part of the path region (and where the 
opinion of others can be learned, or inferred, 
from their behavior). In these cases, persons 

can be expected to direct energy to the under- 
standing of relevant opinions. Hence, every- 


1 This report is based on a dissertation submitted in 
partial fulfillment of the requirements for the Ph.D. 
degree at the University of Illinois. 


thing else being equal, they should be more 
likely to be aware of and to perceive accurately 
those behaviors of others which they define as 
relevant to locomotion toward the goal. 

In the light of research and theory (7, 16, 
18) that conceives of the group as providing & 
psychological social environment for its mem- 
bers, the environment created by the group 
may have predictable effects on the likelihood 
that a given behavior is seen as a “path 
element and hence on the accuracy with which 
it is perceived. It is the general hypothesis of 
this study that the group environment, through 
its effect upon what is perceived as relevant to 
the goals of the group members, predictably 
affects the accuracy with which specifie 
social phenomena are perceived, 

One aspect of the psychological social ¢ 
vironment that groups provide their members 
is that of the social “climate” (18) or “hedonie 
tone” (15) that characterizes the interaction 0 
group members. The relationship of this pie 
nomenon to goal-striving activity of mdma 
uals has been widely documented. Rear 
reported by Lippitt and White (18), Dense 
(7), and Hemphill and Westie (15) has oe 
that friendliness and congeniality of i 
group’s “social climate” occurs concomitantly 
with motivation to persist in working towar 
task goals assigned to the group. There is @ os 
evidence to indicate that individuals as anal 
bers of groups often pursue individual, of er 
centered goals, rather than ones oriented tot i 
group task. The social climate of these ran 
groups has been shown to be one of a 
pleasantness (13), more conflict (10), and - S 
friendliness (7) than the climate of ae 
Where members’ goal-orientations are 
individual in nature. t 

The effect of the group’s climate upo” the 
relevance of a given goal, hence upo? oal 
nature of behavior seen as defining the P ap 
path, provides the bridge between the gich 
environment and the accuracy with Thus, 
individuals perceive specified behaviors- jikel¥ 
if members of congenial groups are more 
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to hold a group goal in common, opinions that 
bear on the group goal are more likely to be 
mutually relevant, hence more accurately 
perceived, than in groups of less congenial 
atmosphere. Conversely, if members of groups 
characterized by uncongenial interaction cli- 
mates are more prone to hold individual, ego- 
centered goals, then hypothetically, opinions 
held and communicated by others that are 
relevant to one’s own ego-centered goals should 
be more relevant, hence more accurately 
perceived. 

On the assumption that task-oriented ac- 
tivity is perceived to be more relevant in con- 
genial than in noncongenial group climates, 
while ego-oriented activity is perceived to be 
more relevant in noncongenial than in congenial 
group climates, the discussion above may be 
more formally stated in terms of two hypoth- 
eses which this study is designed to test. 

Hypothesis I: Members of congenial groups 
are more accurate than members of noncongenial 
groups in perceiving task-oriented behaviors. 

Hypothesis II: Members of noncongenial 
groups are more accurate than members of con- 
genial groups in perceiving ego-oriented behavior. 


METHOD 


Subjects 


One hundred thirty graduate students assumed to 
be relatively homogeneous as to age, intelligence, and 
socioeconomic status (they were public school teachers) 
were recruited from a summer session at the University 
of Illinois. To minimize the effects of previous ac- 
quaintance, each group was composed of Ss drawn from 
a different section of a basic course in educational 
Psychology. The fact that these students were teachers 
whose regular jobs were in communities scattered 
throughout the state was taken to be an additional 
restriction on acquaintance. 


Design 


The Ss were randomly assigned to two treatments 
by group atmosphere, then subdivided into groups of 
five people of the same sex. The groups were first given 
instructions designed to create a congenial or noncon- 
genial atmosphere. Following these instructions, each 
group worked on a standard discussion task, after 
which the members marked a set of rating scales de- 
signed to measure the accuracy of their perception of 
task-oriented and ego-oriented behaviors of others, 
their satisfaction with the progress of the group, and 
their willingness to continue working with the group. 
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Operational Definitions? 


Congenial groups. Congenial groups were those 
groups which, by a process of random selection, were 
given information designed to create the expectation 
that group members would be congenial to one another. 
At the time of recruitment, Ss were given a standard 
personality test and a questionnaire purported to meas- 
ure attitudes towards working in groups, and told that 
this information would be used to help the investigator 
assign them to discussion groups in which they would 
find the other members congenial. Upon arriving at the 
laboratory, Ss assigned to the congenial treatment were 
told that they stood very high on a scale of congeniality 
developed from the preliminary tests—‘“the matching 
was so good as to be expected only once or twice in all 
the groups studied,” and that they should get on well 
together. 

Noncongenial groups. These groups were given infor- 
mation designed to create the expectation that group 
members would not be congenial to one another. The 
Ss assigned to these groups were told that due to the 
limited number of available subjects, not all groups 
could be matched equally well as to congeniality. They 
were told that they stood very low on the congeniality 
scale, that it was necessary to put people with opposing 
views in the same group, that they were about as bad 
a matching as was possible, and that they probably 
would not find their group very congenial. 

Task. The task was introduced as one that would 
enable psychologists to relate the Ss’ scores on the 
previous tests to their ability to take the role of another 
person. It was implied that psychologists think this 
ability important in establishing good human relation- 
ships with other people. 

The Ss were asked to role-play a group of trainees in 
vocational guidance, meeting to discuss the disposition 
of a vocational counselee. They were to reach agree- 
ment upon a recommendation concerning the disposi- 
tion of the case. Each S was given the identical test 
and biographical information about the hypothetical 
client, but in addition received a unique recommenda- 
tion based on one of five different orientations toward 
the data. Thus, one S suggested a job on the basis of 
an orientation that emphasized the need to consider 
present and future job opportunities. Another S’s sug- 
gestion was based on the relationship between person- 
ality factors and job requirements, a third on the role 
of training and past experience, a fourth on test scores, 
and the fifth on test scores in combination with train- 
ing and past experience. The disagreement that was 
thus ensured was expected to facilitate the production 
of behavior indicating interpersonal preference as well 
as of differential orientation toward the path to task 
completion. i 

A standard amount of time was allowed for Ss to 


2? Complete descriptions of the instructions, results 
by individual Ss and detailed statistical tables can be 
obtained in reference (9) from University Microfilms, 
Ann Arbor, Michigan. 

3 Instructions designed to create congenial and non- 
congenial groups were a modified version of those used 
by Back (1). 


384 


familiarize themselves with the data and the point of 
view they were to support in the discussion. To insure 
that each group had a minimum opportunity to hear 
all points of view, the instructions required each person 
to present his suggestion and opinion in turn. After 
each .S had spoken twice, a general discussion was 
permitted. 

Task-oriented behavior. Task-oriented behavior is de- 
fined conceptually as the manipulation, symbolic or 
otherwise, of the idiosyncratic content of the task. Its 
operational definition, like that of ego-oriented be- 
havior, becomes complex when the individual is a 
member of a problem-solving group, for the definition 
must involve reaction to others’ acts. Action and re- 
action are necessary to the processes of communication, 
evaluation, control, decision, and tension reduction 
which according to Bales (2) characterize the sequence 
of group problem solving. Individuals in such groups 
communicate not only their own orientations to the 
problem, but their evaluations of other’s orientations, 
Both such types of communications qualify as task- 
oriented behaviors in that both are necessary to the 
movement of the group toward its goal of decision. 
On the basis of these considerations, task-oriented be- 
havior is operationally defined for present purposes as 
verbal or nonverbal behavior that communicates either 
the frame of reference used by a person to organize the 
task data, or his reaction to the frames of reference 
used by others to organize the same data. 

Ego-oriented behavior. The individual is said to be 
ego oriented when his goal is the enhancement of self 
without regard to the demands of the task environ- 
ment (17). Ego-oriented behavior refers to the manipu- 
lation of elements in the task situation that bear on the 
self as a value. It is assumed that in the process of 
working through disagreements, Ss communicate infor- 
mation pertinent to their orientation to and evaluation 
of their fellow group members as persons, Ego-oriented 
behavior, then, is operationally defined as verbal or 
nonverbal behavior that communicates a person’s 
relative acceptance of other group members, 

Accuracy of perception of task-oriented behavior. Task- 
oriented perceptual accuracy is defined as the discrep- 
ancy between each S’s rank ordering of the use of four 
orientations toward counseling as he thinks the other 
Ss would rank order them, and the other §’s actual 
rank ordering of the same four orientations, 

An error score for each S was obtained as follows 
Each S was given a booklet containing the definitions 
of the orientations toward counseling and five sets of 
tating scales. Each scale consisted of 20 intervals dig 
tributed over six definition 7 


‘The orientation of the fifth 5 was 


of those of the third and fourth Ss * combination 
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to each orientation by each S$ was obtained by averag- 
ing S’s squared errors over all orientations and others. 

Accuracy scores defined in terms of a rank-order 
metric have the advantage of eliminating the possibility 
that low error scores merely reflect the judges’ willing- 
ness to use extremes, or ability to judge deviations of 
others from a central tendency. Scores derived by this 
method reflect a component of accuracy that Cronbach 
(6) terms SAr, i.e., accuracy in judging the rank order- 
ing of the items made by the group as a “generalize 
other.” Since items correspond to task orientations, 
this type of accuracy is assumed to be relevant to the 
problem studied. 


Accuracy of perception of ego-oriented behavior. EgO- 
oriented accuracy was measured in a manner similar 
to that of task-oriented accuracy. In this case; the $ 
rated the extent to which he liked the others and estl- 
mated the ratings each of the others gave their C0- 
workers. Ratings were made on scales which, but for 
re-worded designation points, were identical with those 
used for task-orientation, An error score was derive 
as described in task-oriented perception, In this case 
however, the other members of the group were tag 
items which Ss rank-ordered. 


RESULTS 


Effectiveness of Manipulations 


To establish that the treatments did indeed 
create the necessary differences in congenialitys 
the extent to which members of the various 
groups liked their co-workers, and their willing” 
ness to continue with the group, were assesse™ 
Members of “congenial” groups marke l 
scale of liking of co-workers significant 
higher than did the members of “nonconge™, , 
groups (means of 16.05, 14.60; SDs, 1.99, 19 

= -02, two tails). Similarly, the “congen!@ 
rated themselves Significantly higher ($ = ° he 
m willingness to continue working with 0) 
group (means of 16.72, 15.19; SDs, 3.26, 392; 
Since Ss assigned to congenial groups show" 
no greater Pre-experimental disposition to won 
with groups than did Ss assigned tO noa 
congenial groups®, the above data are taken r 
evidence that the inductions create H 

esired climates of congeniality. 


5 The con 
genial 
dispositi 
Sponse to an j 
their cl; 


IN groups. 

We 

oe = .94) a 
S i? 2 

‘A tof gongenial groups did not answer the a jev”! 


(two-tailed t 
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TABLE 1 TABLE 2 
MEAN Error Score IN ESTIMATING TASK- AND ANALYSIS OF VARIANCE OF TASK- 
EGO-ORIENTED BEHAVIOR MADE BY SUBJECTS ESTIMATION ERROR OF Smo E 
CATEGORIZED AS TO CONGENIALITY CATEGORIZED BY SEX AND 
ATMENTS AND SEX CONGENIALITY TREATMENT 
Treatment Source of Variation df Mean Square F 
Perception Measure Congenial Noncongenial Sex 1 5.36 85 
z = m = Congeniality treatment 1 41.63 6.62* 
: N x N x Interaction: sex X 1 1.95 31 
= E e iality 
Task-oriented Pith eats 24 29 
Mises @ Ge p goo “Seo ; 8 
Female subjects 32 6.51 32 7.89 * Significant at .05 level or beyond. 
Ego-oriented* 
Male subjects 20 9.55 25 10.11 
Female subjects 30 7.98 19% 8.96 TABLE 3 


® Mean-per-person error is based on five-person groups only. 
The difiiculty of this estimation was not comparable in the ve- and 
four-person groups. 

b One S filled out scales improperly. 


Tests of Hypotheses 


The hypotheses concerning the relationship 
between social climate and the accuracy of 
perception of task and ego-oriented behavior 
of others were tested by applying analysis of 
variance techniques to mean-per-person error 
scores. Although the problem was not formu- 
lated in terms of sex differences, the composi- 
tion of the groups permits this comparison. 

Hypothesis I. Mean-per-person error scores 
on task-oriented perception for congenial and 
noncongenial groups are listed in Table 1 and 
the results of the analysis of variance in Table 
2. The results indicate that members of con- 
congenial groups, regardless of sex, are sig- 
nificantly more accurate than members of non- 
congenial groups in task-oriented perception. 

Hypothesis IT. Mean-per-person error scores 
on ego-oriented perception for congenial and 
noncongenial groups are listed in Table 1 and 
the results of the analysis of variance in Table 
3. An examination of ego-oriented perception 
error scores shows no statistically significant 
differences between congenial and noncongenial 
groups, although significant differences at- 
tributable to sex are readily apparent. 

Hypothesis I is thus confirmed by the data, 
while Hypothesis II is not. Indeed, the data 
indicated a trend, albeit a statistically insig- 
nificant one, in the direction of greater ego- 
oriented accuracy on the part of members of 
congenial groups. An unanticipated but in- 
teresting finding is that women are more 
accurate than men in the estimation of their 
Co-workers’ interpersonal likes and dislikes. 


ANALYSIS OF VARIANCE OF EGO-ORIENTED ESTIMATION 
Error OF SUBJECTS CATEGORIZED BY SEX 
AND CONGENIALITY TREATMENT 


Source of Variation dfe Mean Square F 


Sex 1 43.47 4.90* 

Congeniality treatment 1 14.33 1.62 

Interaction: sex X 1 71 08 
treatment 

Within groups 90 8.87 


* Significant at the .05 level or beyond. 
® Tsao’s (24) approximation method was applied to correct for 
disproportionate subclass n. 


Post Hoc Analyses 


Examination of Ss’ responses to the post- 
meeting questionnaire provide additional data 
concerning the relationship between group cli- 
mate and accuracy in perceiving task and ego- 
oriented behavior. It has already been men- 
tioned that Ss were asked to indicate their 
willingness to continue working with the group 
on similar tasks in order to measure the effect 
of the congeniality inductions. If this measure 
is a valid index of congeniality, Ss more willing 
to continue should perceive task-oriented be- 
havior more accurately than do those who find 
the group less attractive, regardless of experi- 
mental treatment; while, conversely, those less 
willing to continue should perceive ego-oriented 
material more accurately. 

These post hoc hypotheses were tested by first 
categorizing Ss according to whether their 
willingness to work with the same group was 
above or below the median of the pooled scores, 
then applying analysis of variance techniques 
to determine the effects of congeniality induc- 
tions, willingness, and the interaction between 

the two upon the social perception error scores. 
Mean-per-person error scores are listed in 
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TABLE 4 
E IN ESTIMATING TASK- AND 
MEAN ERROR SCORE IN 
EGO-ORIENTED BEHAVIOR MADE BY SUBJECTS 
CATEGORIZED AS TO WILLINGNESS TO 
CONTINUE AND CONGENIALITY 


Treatment 


Congenial 


N X N X 


Perception Measure Noncongenial 


Task-oriented 5 
High willingness 37 
Low willingness 27 

Ego-oriented 
Tigh willingness 29 5 
Low willingness 21 7.89 26 


TABLE 5 
ANALYSIS OF VARIANCE OF MEAN Error Scores 1N 
TASK-ORIENTED PERCEPTION MADE By More 
AND Less WILLING SUBJECTS 


Source of Variation df* Mean Square F 


Willingness 1 27.08  4,593* 

Congeniality treat- 1 31.36  5.318* 
ment 

Interaction: willing- 8.99 1.524 
ness X congeniality 

Within groups 124 6.12 


* Significant at the 05 level or beyond. 


a Tsao’s (24) approximation method was used to correct dis- 
proportionate subclass n, 


Table 4 and the results 
iance in Tables 5 and 6. 

Inspection of the means in Tabl 
that, in both congenial and no 
groups, Ss who were more willi 
with the group made fewer task-oriented errors 
than did their less willing co-workers. It is also 
apparent that in both cong 
genial groups, fewer 


of the analysis of var- 


e 4 shows 


significant at better than the .05 level, the data 
support the hypotheses,® 


5 Although no data concerning the Productivity of 
the groups were collected, it is of interest to note that 
members of congenial groups expressed greater satis- 
faction with the progress of their groups. Mean satis- 
faction ratings, on a 20-interval scale similar to those 
described previously, were noted as follows: Congenial 
male and female, 14.44 and 15.47 Tespectively; non- 
congenial male and female, 12.19 and 11.06 respectively, 
With mean squares of 354.44, .07, 37.19, and 18.17 for 
congeniality, sex, interaction, and error variance, re- 
spectively, the only significant F was that of 19.51 
(significant at the .001 level with df’s of 1 and 124) 
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TABLE 6 

ANALYSIS OF VARIANCE OF MEAN ERROR SCORE IN 

EGO-ORIENTED PERCEPTION MADE BY MORE 
AND Less WILLING SUBJECTS 


Source of Variation df* Mean Square F 


* 

Willingness 1 40.64 4.089 

Congeniality treat- 1 16.38 1.684 
ment 

Interaction: willing- 1 1.41 -014 
ness X congeniality 

Within groups 90 9,94 


* Significant at the .05 level or beyond. dis- 
2 Tsao’s (24) approximation method was used to correct dis 
Proportionate subclass n. 


Discussion 

Results reported above call into question ma 
generality of a theory of accuracy of socia 
perception that postulates accuracy as a func- 
tion of the relevance of the perceived behavior 
to the goals of the Perceivers. On the one hand, 
the finding that task-oriented phenomena are 
more accurately perceived in congenial ape 
atmospheres, where they are assumed to be 
more relevant to the goals of group membe 
would appear to provide support for t 
theory, On the other hand, the theory was n° 
Supported by the finding that ego-oriente 
Phenomena are not more accurately perceive’ 
in the noncongenial group condition, the setting 
where ego-orientation was assumed to be mor 
relevant, of 

It is possible that the operational measur® 5 
ego-oriented accuracy of perception did pat 
Provide a valid test of the theory with referen i 
to such behavior. There is evidence to suggor 
(1, 2, 13) that the mapping of interperson" 
relations is a relevant activity for members 
congenial groups, 

On the other hand, existing theory and gars 
Suggest that regardless of the validity ity 
Operationally defining ego-relevant oe 
as the assessment of interpersonal relation” 
one could not expect such assessment tO ry 
accurately carried out. Newcomb’s (19) es 
of communicative acts, for example, Suge ot 
that accuracy of Perception of others’ behav. 


: ich 
1n a group is a function of the extent to wh 


the 
found for Congeniality. It is uncertain whether k- 
difference noted above is related to the greater S 
oriented accuracy recorded in congenial gron P duc- 
merely reflects the impact of the congeniality re not 
tions. In view of this uncertainty these scores & 
discussed further, 


Group CLIMATE AND ACCURACY OF SOCIAL PERCEPTION 


the behavior can be said to be an object com- 
mon to group members. Thus, while it is 
Possible to conceive of task-oriented behavior as 
an object common to the members of congenial 
groups, it is less easy to see how others’ be- 
havior perceived as relevant to the private, ego- 
oriented goals of one group member can be 
said to be held in common by all group mem- 
bers. Also relevant to this discussion are data 
reported by Tagiuri, Blake, and Bruner (23), 
who indicated Ss could not accurately perceive 
Cues of rejection. To the extent that such cues 
are more likely to occur in noncongenial than 
In congenial groups, a factor favoring greater 
accuracy of perception of interpersonal rela- 
lions exists in congenial groups. 

In spite of the above arguments, there re- 
Mains the fact that interpersonal relations were 
Perceived with greater accuracy by those less 
Willing to continue their association with the 
group. It was suggested, moreover, that this 
result could be interpreted as supporting the 
hypothesized relationship between the con- 
geniality of the group and accuracy of per- 
Ception of ego-relevant behavior. 

An alternate interpretation may explain this 
relationship. This interpretation, which as- 
Sumes that interpersonal conflict occurs in the 
groups, is at variance with the specific deriva- 
tion embodied in Hypothesis IT, but not neces- 
Sarily with the general theoretical position of 
this paper (conflict, after all, is a feature of the 
&roup’s environment). If cues concerning inter- 
Personal relations are more likely to be ob- 
Served in situations of interpersonal conflict, 
and if one can assume that conflict adversely 
affects a member’s willingness to continue with 
the group, it follows that accuracy in percep- 
tion of interpersonal relations and unwilling- 
ness to continue with the group may be con- 
comitant results of conflict. Unfortunately, 
there are no measures of the extent to which 
such conflict marked the process of group 
discussion. The question of the meaning of 
more accurate perception of interpersonal 
relations on the part of those less willing to 
continue with the group thus remains un- 
answered. In view of the speculative, even post 
hoc, nature of the postulated relationships 
among willingness, congeniality, and accuracy 
of perception, it is suggested that generalization 
of the theory herein presented be limited to the 
Perception of task-oriented phenomena. The 
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significant relationships noted between willing- 
ness and the two kinds of accuracy of percep- 
tion, however, do suggest the desirability of 
further exploring the nature of these relation- 
ships, as well as their implications for the 
processes of problem-solving groups. 

Finally, the unanticipated finding that women 
perceive interpersonal relations more ac- 
curately than men appears to merit some 
discussion. This difference is not related to sex 
differences in willingness to work with the 
group, a factor already shown to be inversely 
related to such accuracy, for no differences 
between the willingness of men and women 
were found in either treatment. We must look 
further for an explanation of this phenomenon. 

One explanation of this finding rests on the 
assumption that our culture sanctions freer 
expression of feeling by women than by men. 
It would follow, from this assumption, that 
more women than men would openly express 
their likes and dislikes concerning their co- 
workers. This explanation would appear to be 
related to the theory of sex-role differentiation 
developed by Parsons and Bales (20), and 
studies in jury deliberations by Strodtbeck and 
Mann (21). The results herein reported might 
be interpreted as further and more general 
confirmations of the above theory concerning 


woman’s sex role. 
An alternative explanation might be phrased 


in terms of social power. If the general posi- 
tion of women in our society is one which 
allows them less social power than men, it may 
be important for them to be able to accurately 
assess the feelings, attitudes, and relationships 
among others in order to achieve their personal 
goals. 

Finally, women may be higher in “affiliation 
motivation” than men. If this isso, French and 
Chadwick’s (11) finding that those having high 
affiliation motivation are more accurate in 
perceiving their own popularity status would 
be pertinent to our data. Research designed to 
test the validity of the above speculations is in 
progress. 


SUMMARY 


Twenty-eight groups of five or four graduate 
students each, controlled for sex, education 
and acquaintance were assigned to congenial or 
noncongenial group climates. Analysis of 
variance was used to test the effects of group 
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climates, motivational levels, and the impor- 
tance of sex difference upon perceptions of task 
and ego-oriented behavior. 

The findings were as follows: : 

1. Individuals in congenial groups were sig- 
nificantly more accurate in perceiving task- 
oriented behavior of their group than were 
members of noncongenial groups. 

2. Individuals in noncongenial groups were 
not more accurate in perceiving the inter- 
personal relations in the groups than were in- 
dividuals in congenial groups. 

3. Individuals willing to continue to work 
with the group on a similar task were more 
accurate in perceiving task-oriented behavior 
of their group than were individuals less will- 
ing to continue with the group. 

4. Individuals willing to continue to work with 
the group on a similar task were significantly 
less accurate in perceiving interpersonal rela- 
tions within their group than were individuals 
less willing to continue with the group. 

5. Women, though neither more nor less 
willing to continue with their group, were 
significantly more accurate than men jn 
perceiving interpersonal relations, 

It is concluded that 
predictably affect the 
oriented behavior variables, but that further 


social climates can 
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CRITIQUE AND NOTES 


ON THE PHENOMENA OF “OPPOSITE SPEECH”! 


BERNARD KAPLAN 
Clark University 


T A RECENT case study, Laffal ef al. (16) reported 
a schizophrenic patient exemplifying “an 
unusual language syndrome” which the authors 
refer to as “opposite speech.” This syndrome “con- 
sists basically in the use of ‘yes’ by the patient 
when he means ‘no,’ and vice versa” (16, p. 409). 
On a level closer to the data, the syndrome seems 
to be manifested in the interchange of words of 
contrasting signification; thus, the patient ap- 
peared to interchange “right” and “wrong,” “do” 
and “don’t,” “like” and “hate,” “always” and 
“never,” “any” and “none,” etc. Such opposite 
speech phenomena were manifested in both recep- 
tion and expression, in both oral and written dis- 
course, and in both spontaneous utterances and 
responses to questions and test problems. The 
patient was seemingly unaware of the interchanges; 
furthermore, there was no analogy to this type of 
speech activity on the level of concrete, practical 
action. 

In their discussion, Laffal and his collaborators 
emphasized the different needs of the patient which 
the opposite speech phenomena might have served ; 
they also considered the inadequacies of contiguity- 
reinforcement theory in accounting for these 
phenomena. However, with respect to the nature 
of the syndrome in the light of cognitive develop- 
ment, the authors limited themselves to a few pre- 


‘liminary remarks and did not take up the issue 


subsequently. Confronted by what Arieti has de- 
scribed as the schizophrenic’s “teleological regres- 
sion” (2, p. 192), they dealt predominantly with 
the ostensible teleology and secondarily with the 
nature of the regression. d 

It is the purpose of this paper to consider 
“opposite speech” within the categories of a theory 
of the development of cognition—a theory con- 
cerned with general, formal properties of cognitive 
activity, obtaining in phylogenesis, ontogenesis, 
cultural evolution, psychopathology, etc. (24, 
25, 26, 27). 

The fundamental principle of this comparative- 
developmental approach to cognition is that wher- 
ever development occurs, it proceeds from a rela- 
tively global and undifferentiated state to one of 


1 This paper derives, in part, from research and study 
carried out within a project on normal and pathological 
symbolization and communication. This project, di- 
rected by Heinz Werner and the author, is supported 
bya grant from The Foundations’ Fund for Research 


in Psychiatry. 


increasing differentiation, articulation, and inte- 
gration; on the other hand, where development is 
undone, there is a return to a greater autonomy of 
local functions and structures. These correlative 
principles have the status of heuristic laws (cf. 22); 
though not themselves subject to empirical test, 
they are valuable in the formulation of empirically 
testable generalizations. 

One such empirical generalization is that adult 
schizophrenics, in deviating from normal adult 
modes of organizing experience, move in the 
direction of a relative dedifferentiation of their 
cognitively constructed world, and toward a 
greater autonomy of local functional patterns 
organized on a lower level of integration (e.g., 
toward a greater autonomy of concrete, practical 
activity with respect to linguistically articulated, 
conceptual thought).? 

Insofar as cognitive activity in schizophrenics 
undergoes this dedifferentiation and disintegration, 
it will show similarities in structure (cf. 23, p. 27 ff.) 
to cognitive activity occurring under a variety of 
conditions and among certain groups of nonschizo- 
phrenic individuals. Thus, schizophrenic cognitive 
activity will tend to show formal similarities to the 
activities of normal man (a) under transitory con- 
ditions of stress, anxiety, or distraction; (b) in the 
process of expressing experiences in primitive 
media of symbolization, e.g., dreams, gestures, etc.; 
(c) in earlier stages of sociocultural linguistic 
codifications of experience. Schizophrenic cognitive 
activity may also be expected to show similar 
formal properties to those reflected in the linguistic 
articulation of experiences among (d) very young 
children; and (e) such organically dedifferentiated 
individuals as certain aphasics. 

There follows (a) a critical discussion of the 
findings presented by Laffal et al.; (b) an applica- 
tion of the developmental principle to show why 
“opposite speech” phenomena are regarded as 
involving a more or less primitive cognitive 
activity; (c) a summary of the empirical evidence 


? Dedifierentiation or regression are not here con- 
sidered as processes of delamination. Regression may 
be manifested in a return to certain more primitive 
levels of cognitive activity without entailing the im- 
pairment of “schemata” or “apparatuses” structured 
in the organism in the course of mastering other, genet- 
ically more advanced, operations. Furthermore, regres- 
sion does not here signify a re-evoking of old content, 
but rather a return to more primitive forms of thought 
and patterns of operations (cf. 2, 5). 
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from each of the five above-mentioned S ia 
order to evaluate the thesis that opposite speech’ 
is more characteristic of relatively primitive rom 
tion than of logico-scientific cognition. Finally, 
(d) some of the findings obtained in two recent 
experimental studies on symbolization will be 
discussed in order further to support the main 
thesi: his paper. 
Eo = teenie cited by Lafial as illustra- 
tions of “‘opposite speech,” there is at least the 
possibility that these investigators have subsumed 
under a common rubic essentially dissimilar 
phenomena with respect to cognitive processes. 
Thus, when they state that the patient remarked, 
“Pm hungry,” and then refused to take food from 
his mother, there is at least the possibility that his 
refusal was based on some delusion with respect 
to food coming from his mother (cf, 15, p. 40); the 
refusal does not necessarily warrant the interpreta- 
tion that the patient really meant “Pm not 
hungry.” Again, when a male schizophrenic, to the 
statement, I am a man responds with “no” (writ- 
ten) and “Nope, I’m no man” (oral), it does not 
follow that he is confusing or interchanging “no” 
with “yes”; he may well be deluded with respect 
to his sex. There are several other examples whose 
classification involves the same arbitrariness of 
interpretation, but the above may suffice to ground 
the criticism that several diversely mediated 
Phenomena may well be grouped by Laffal et al, 


Tentiatedness 


for logically opposite 
meanings, e.g., the use of “like” for both “like” 
and “hate,” etc. 


Thus, it seems that “opposite speech” was 
operative when the patient elaborated on “limping 
around” by adding “able to walk” or on “dis. 
gusted” by remarking “when you're feeling good, 
happy and contented.” Again, it seems that 
“opposite speech” was in play when the patient 
completed the sentence What makes me angry... 
with “T feel good, I get very nervous inside, I 
start to like.” 

The underlying confusions between affirmation 
and negation or between opposing predicates seem 
especially clear in those instances when the pa- 
tient appeared to be struggling to clarify his 
thought-expression; for example, when he com- 

pleted the sentence, J feel like cursing when. . . with 
“things go good, or if I have any hard luck, get in 
trouble in any way. Well, if things go right. . . If I 
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have some plans and things come up to break 
it... don’t come up to break it.” es 
The grounds on which developmental theorists 
expect opposite speech phenomena to be more 
characteristic of primitive thought than of rela- 
tively advanced, logico-scientific thought | may 
now be briefly considered. As has been mentioned, 
thought is defined as more primitive the more 
undifferentiated and unintegrated it is. Opposite 
speech is presumed to arise because the content 
to which the linguistic signs refer is in such x 
global, undifferentiated state that the linguistic 
vehicles (seemingly discrete) really share the 
same global referent (3, pp. 182 ff.; 17; p: 18; 19, 
P. 40; 20, pp. 119 ff.). Concretely, a linguistic sign 
like “hate” and a sign like “love” both refer toa 
sphere of “strong affective response” as distin- 
guished from “neutral reaction”; signs like “hot 
and “cold” both refer to a semantic sphere of 
“extreme deviations from normal temperature, 
etc. It is due to the undifferentiated character g 
the referent, then, that two signs or linguistic 
sign complexes, which in mature thought function 
to articulate and differentiate a global semantic 
sphere, in primitive thought are used inter- 
changeably. It follows from this characterization 
that when there is complete and systematic 
exchange of symbols (“love” used always or 
“hate” and vice versa), opposite speech is 2° 
necessarily manifest, Opposite speech is in play 
only where the same sign or sign-complex refe!s 
now correctly, now incorrectly. sad 
Turning now to the five domains mentione” 
above, one may examine the evidence to ae 
mine whether there is warrant in concluding i 
such verbal interchanges or undifferentiated e 
bolizations are more characteristic of the 
domains than of logico-scientific domain. $s 
First, even in normal adults conditions of an 
or anxiety—presumably leading to a dedifferent! 
tion of thought—promote the emergence id 
“opposite speech” phenomena, e.g., “left” is pe 
for “right,” “on” for “off,” etc. Many phenome 
subsumed by Psychoanalysts under the ere 
of “psychopathology of everyday life” are cee 
cases of opposite speech. In this connection per 
remarks, “Numerous observations have taus 


3 It may be noted that in most of the examples Prg 
vided by Laffal ef al., the manifestations of Rene 
speech” may have come about through the GIMS ig 
the patient of “no or “don’t”; in other words, per 
often appears to be an interchange of opposites fying 
rather be a Positive judgment, indifferently sign are 
both an affirmative and a negative judgment. Sa y 
some instances, however, which seem unambiguo™ jen 
involve an interchange of opposites, e.g., the Pp? of 
completes the sentence, “When people make 
me...,” with “I start to like.” 
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me... that we frequently interchange contrasting 
words; they are already associated in our speech 
consciousness; they lie very close together and 
are easily incorrectly evoked” (9, p. 43). 

Second, opposite speech phenomena are char- 
acteristically exhibited in the relation between 
manifest and latent dream contents. Freud, 
summarizing his observations of dreams, remarks: 
“The way in which dreams treat the category of 
contraries and contradictories is highly remark- 
able. It is simply disregarded. ‘No’ scems not to 
exist so far as dreams are concerned. They show a 
particular preference for combining contraries in a 
unity, or for representing them as one and the 
same thing. Dreams feel themselves at liberty, 
Moreover, to represent any element by its wishful 
contrary; so that there is no way of deciding at a 
first glance whether any clement that admits of a 
contrary is present in the dream thoughts as a 
Positive or as a negative” (10, p. 318). If one 
accepts Freud’s inquiries as representative and 
his analyses as correct, it would seem that opposite 
Speech phenomena are characteristic of the dream 
State. 

Third, it has been argued that there is a strong 
tendency toward opposite speech in early forms 
of language. Thus Karl Abel (1, pp. 225 ff.) claimed 
that in the most ancient languages there character- 
istically appears the use of a single verbal symbol 
to represent what for Western adult mentality are 
two contrasting significations, two contraries at 
the extreme ends of a series of qualities or 
activities. Even when distinct terms for the two 
Contraries emerge, their kinship is reflected in the 
fact that the differentiation is achieved by slightly 
modifying the “mother form” to express one of the 
contraries, while the original form is retained for 
the other. Abel illustrates his thesis with reference 
to the Coptic language: In Coptic, kef is used to 
express both “to take” or “to let lie”; ken 
designates “strong” and/or “weak.” The symbols 
for “stand” and “move” are men and menmen, 
respectively—this last example illustrating the 
Primal emergence and representation of more 
Precise determinations of the common, global 
sphere.4 

Research on some contemporary “primitive” 
(cf. 27) languages yield the presence of similar 
phenomena. For example, Westermann, in his 
study of the Ewe language (cited in 6) observes 
that one and the same adverb serves to designate 
both “yesterday” and “tomorrow”; again, in the 


4 According to Abel, the same tendency to express 
contrasting significations by a single symbol character- 
ìzes early stages of the Semitic and Indo-European 
languages. For a more recent linguistic discussion of 
Such undifferentiated, “neutral” symbols, see (13), 
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Shambala language, the same word refers both to 
the earliest time and the distant future.® 

Fourth, in their renowned work, Die Kinder- 
sprache, Clara and William Stern (21) cite many 
examples of the use of a single symbol among 
children for what to normal adults are contrasting 
significations. For example, their daughter, Hilde 
at 1214 months used “papa” to designate both her 
father and mother; at 16 months she used “mama” 
to designate both her mother and father; at 18 
months she interchanged “ja” and “nein” (pp. 
29); at 21 months she confused “winter” and 
“sonne,” and also “kalt” and “heiss” (pp. 35 ff.) 
at two years of age, she would use the adverb of 
place, “zu,” for “zu” and “auf” (pp. 56). 

According to the Sterns, the use of antithetical 
words belongs almost exclusively to the first 
stages of child language. They differentiate a 
small number of groups of words which seem to be 
interchanged with some regularity among the 
children discussed in the literature. One group, for 
example, consists of place adverbs of motion. 
Among German children, the words “an” and 
“auf” are especially interchanged. A second group 
involves expressions of activity. For instance, one 
investigator’s daughter used “bi” both for being 
dressed and being undressed. Another investigator 
observed that his son at 23 months used the word 
“moga” for “I can” and “I can’t,” and also 
“znam” for “I know” and “I don’t know.” A 
third group in which antithetical meanings are 
expressed by a single verbal symbol comprises 
adjectives, very frequently adjectives of tempera- 
ture; the Sterns mention that seven of the children 
discussed in the literature interchanged “heiss” 
and “kalt,” etc. 

Decroly’s (8, p. 234) investigations supplement 
the findings cited by the Sterns and others. In 
studying his child, E., Decroly found many in- 
stances of “opposite speech.” Thus, feme was used 
to express both “fermer” (close) and “ouvrir” 
(open); cho signified both “chaud” (warm) and 
“froid” (cold); lume expressed both “allumer” 
(light up) and “eteindre” (extinguish); pen was 


6 Cassirer, in discussing this lack of differentiation 
between expressions for past and future, observes that 
it derives, in part, from the Primitive intuition of time 
as a thing. A sharp distinction is made early between 
the contents of the now and the not-now, but the not- 
now is not given any further determination (6 p. 220) 

, ê The Chamberlains (7), not cited by the Sterns, 
ive some examples, analogous to those quoted by 
Laffal et al., with respect to their patient’s in : 
of “any” for “no” or “none.” At 31m 
often used “any,” “anything,” or “anybody” in thi 
sense of “none,” “nothing,” “nobody”: thus, she es 


marked, “Any more pickas in zat book” for “No mor 
Pictures... ,” etc. i 


terchange 
onths, their girl 
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“ e” (take) and “donner” (give), 
oF a Pe serve to support the thesis 
ae opposite speech is relatively more character- 
=e of the linguistic activity of younger children 
Gon of older children and adults. ; 

Fifth, students of the language behavior of 
aphasics have often remarked the occurrence of 
henomena subsumable under the concept of 
Veprepgnite speech.” One of the earliest investiga- 
tors, Bastian, cites several cases in which aphasic 
patients could say both “yes” and no,” but 
could not always use these terms appropriately 
(4, pp. 97, 99). Additional examples of such 
linguistic interchanges may be found in the works 
of Head (14), Goldstein (12), and Weisenburg and 
McBride (23). : i 

Head remarks of one of his patients that “he 
could not grasp the exact meaning of ‘to’ and ‘past’ 
(the hour), ‘high’ and ‘low,’ ‘up,’ and ‘down,’ 
‘back’ and ‘front,’ ‘right’ and ‘left? when con- 
trasted with one another. Asked, ‘Would you like 
to stop in or go out?’ he would reply, ‘Either,’ in 
order to hide his want of complete comprehension” 
(14, 1, p. 180). 

One of the patients examined by Weisenburg 
and McBride answered “yes, yes” to all 
when he meant “no” (p. 
employment of opposites, 
to the authors, showed “an incre; 
to lose the controlling idea ‘ 
patient tended to use “onl 


speech” to a relatively | 
opment of linguistic th 


part of the first study, carried out by R. Pollack 
ging in age 
2 years, six 
of college students were 
presented with the task of selecting colors for the 


representation of 20 mood terms (e.g., happy, sad; 
nice, nasty, etc.) so chosen that the total list 
consisted of 10 pairs of opposites, all subsumable 
under two broad classes of positively toned and 
negatively toned words. In general, it was found 
that younger children tended more than older 
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ones to employ the same color to represent what, 
to adults, are more or less antithetical meanings. 
Moreover, when the opposition between terms was 
highlighted by juxtaposing the antonyms, the older 
children and adults seemed to benefit from the 
experimental polarization of opposites by oe 
consistently selecting different colors for the anti- 
thetical words; the younger children showed no 
noticeable improvement. The groups were then 
asked to symbolize the mood terms expressively 
by drawing linear symbols, Only the adults expres- 
sively distinguished Positive and negative mood 
terms by linear schematization when the mood 
terms were presented in random order. When 
presented with a juxtaposition of opposites, the 
youngest group of children alone showed no im- 
Provement in the tendency to diflerentiate the 
Positive and negative opposites. This study points 
to the relative undificrentiatedness among younger 
children of what later are given determination as 
opposites, 

The second experimental study, carried out by 
A. Goldman (11), was concerned with symbolic 
reception and expression among schizophrenics 
contrasted with normal adults. In the first part -i 
the study, the two groups of subjects were et 
domly presented with 20 mood terms to be repr 
sented in line patterns; again the total group 2 
words consisted of 10 pairs of opposites. In rere 
second part of the experiment, the subjects Vno 
presented with line symbols constructed by ath 
experimenter and asked to match the symbols y n- 
the mood terms. Goldman found that schizophroi 
ics, both receptively and expressively, were ri to 
likely than normals to utilize the same amb 
express antithetical meanings or sharply ie 
ing symbols to express the same meanings. than 
a schizophrenic was significantly more likely t ht, 
a normal to take on one occasion a curved, mi 
upward line, and on a second occasion an angu 
dark, downward line as expressive of “joy Ton 
use the light, curved, upward line on one occas aS 
as expressive of “joy” and on a second occasion 
expressive of “sadness.” z and 

The findings of the experiments by Pollack ce 
Goldman, and the general tenor of the evide h 
from the five domains in which primitive thera 
expression might plausibly be thought tog osi 
Suggest that the processes underlying “opP' nics 
speech” occur not only in a few schizo} pe 
employing a Strange means to avoid anxiety vide 
express hostility, but may be found in any 1P ati 
ual, characteristically or momentarily D of 
under conditions conducive to a primitivizat™ i 
the level of symbolic articulation and organ’ vs 
of experience. It is hoped that this discussion . siS 
the value of a Comparative-developmental an^ 
of the formal aspects of cognitive functioning 
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complement to dynamic considerations for a fuller 
understanding of schizophrenic symptomatology 
(cf. 2, Part ITI). 
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A REPLICATION AND 


FRANKLYN N. ARNHOFF 
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I A Previous paper (2), the writer reported an 
tes exploratory study in which ethnocentrism 
task Scores and error scores on 
to of stimulus generalization (SG) were found 
o C positively related (r = 44, N = 25) tending 
Sites support to the hypothesis that ethnocen- 
en m Tepresents an “overgeneralization phenom- 
in on.” During the course of an extensive 
fens of SG, the previous study was 
es icated with refinements and additional con- 

ols, In addition to the visual-spatial task (lights), 
a more cognitive task (blocks) was added to the 


a visual-spatial 


Unit, Dept. of Mental Hygiene, Syracuse, N. Y. 


and both SG tasks were related to 
the subjects’ (Ss’) scores on the California Ethno- 
centrism Scale (E scale) (1). As this study was 
part of a larger study reported elsewhere (3), and 
the two generalization tasks have been previously 
described in detail (2, 3, 4, 5), the reader is referred 
to other sources for descriptions of the tasks and 
the experimental design. 

The volunteer Ss were 60 female student nurses, 
aged 18 through 22. As far as can be determined, 
the sample was comparable to the one in the pre- 


vious study (2). 


present study 
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RESULTS AND Discussion 


Following administration of the two SG tasks, 
Ss were given the E scale and the CVS Intelligence 
Scale. Prior to the study proper, all Ss had been 
administered the group form of the Minnesota 
Multiphasic Personality Inventory (MMPI). 

Generalization was defined as the number of 
errors made on the two tasks. Total error scores 
were computed for each S on each task. These 
were then correlated with Ss’ E scale scores, yield- 
ing 7’s of .10 (lights) and .15 (blocks), values 
falling far short of significance. 

IQ was not found to be a significant factor in 
SG error scores, although its relation to the E 
scale (r = —.31) was consistent with previously 
reported findings (1). 

Test-retest scores on the E scale were found 
reliable (r = .85) and overdefensive responding 
as measured by the MMPI K scale was only 
minimally related to E scale scores (r —.29), 
ruling out these factors as explanation for the 
present failure. 

The previous positive findings must therefore 
probably be attributed to chance. While present 
results do not preclude the possibility that ethno- 
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EDWARDS 


centrism is related to overgeneralization in think- 
ing, the functions involved in performance on the 
tasks investigated do not appear to be on the same 
psychological continuum. 
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OF ENDORSEMENT OF ITEMS IN 
CHECK LIST 


ALLEN L. EDWARDS 
University of Washin glon 


NE oF the most widely used methods for 
O obtaining self-descriptions of personality is 
the personality inventory. As is well known, the 
typical personality inventory consists of a number 
of statements or items to which a person is asked 
to respond by indicating all those that he believes 
characterize himself. His responses to the state- 
ments thus provide his self-description. If a large 
number of individuals are given the same inven- 
tory, it is possible to find the proportion endorsing 
each statement, that is, the proportion who say 
that a given statement is applicable in self-de- 
scription. Each of these proportions may be re- 
garded as the probability that a given statement 
will be answered “Yes,” “Agree,” or “True,” de- 
pending on the form of the inventory. 

It has been found that it is possible to predict 
these probabilities of endorsement with a rather 


1 This paper was presented at the American Psy- 
chological Association meetings, Chicago, Illinois, Sep- 
tember 1, 1956. The study was supported by a grant 
from the Graduate School of the University of Wash- 


ington. 


e 
high degree of accuracy. The predictions a 
possible because the probabilities of endorse™ 
are linearly related to another independe” s 
determined characteristic of the statement® 
namely, their social desirability scale values: aen 
relationship between these two variables has 7 
found consistently for a variety of statem? 
investigated, er- 
The social desirability scale value of a PY, 
sonality statement refers to its location O" 
psychological continuum of social desirability pe 
judging group, for example, is asked to ocial 
comparative judgments of the degree of on 
desirability of each of the statements in a Sè s of 
the basis of these judgments, and by oe ale 
one of the Psychological scaling methods, # 
value is obtained for each statement on a PSY ale 
logical continuum of social desirability. High 
values on this continuum represent oe 
that are judged as socially desirable and il 
values represent statements that are judg 
socially undesirable, 


sen 
r give 
Suppose we scale the statements mn 


ao 


As $ 
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personality inventory for social desirability in 
terms of the judgments of one group of subjects. 
The inventory is then administered to a new and 
independent group, that is, one other than that 
used in determining the social desirability scale 
values, with the usual instructions to provide 
self-descriptions. For this second group, the pro- 
portion endorsing each of the statements in the 
inventory is obtained. It is then possible to de- 
termine the relationship between probability of 
endorsement and social desirability scale value 
for the set of statements. 

Following the procedure just described, Ed- 
wards (1) found a product-moment correlation of 
‘87 between probability of endorsement and 
social desirability scale value for a set of 140 
personality statements. This finding has since 
been substantiated by Kenny (4) and by Hanley 
(3) and, indirectly, by Rosen (10). Edwards’ 
rescarch was done with statements subsequently 
used in developing the PPS (2), a forced-choice 
inventory in which statements are paired on the 
basis of their social desirability scale values. 
Kenny’s research was done with a set of person- 
ality statements originally used in a study by 
Zimmer (12). The statements used by Hanley 
were selected from three of the MMPI scales. 
Rosen’s study was also done with statements from 
the MMPI. 

In all of the studies mentioned above, the 
subjects given the personality inventories were 
asked to sign their names on their answer sheets. 
That this requirement may be an important 
factor in the relationship between probability of 
endorsement and social desirability scale value is 
indicated by Meehl and Hathaway (8). After 
surveying research by Maller (7), Olson (9), and 
Spencer (11) on self-descriptions made under 
anonymous conditions, Meehl and Hathaway con- 
clude that requiring subjects to sign their names 
to their self-descriptions has been shown to have 
a definite influence on the descriptions obtained. 
The present study was undertaken, therefore, to 
determine whether the relationship between prob- 
ability of endorsement and social desirability scale 
value would be found when subjects made their 
assessments anonymously. 


METHOD 


The statements used in the study were those appear- 
ing in the 128-item Interpersonal Check List (ICL). 
The ICL was developed by staff members of the Kaiser 
Foundation Research Project in Psychology. It has 
been described in an article by LaForge and Suczek (5) 
and has been prepared in booklet form by Leary (6). 
The ICL can be used for purposes of both self-assess- 
ment and assessment by others. In self-assessments, the 
S checks each item he regards as self-descriptive. The 
ICL thus closely resembles the typical personality in- 
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ventory in which the S responds “Yes” or “No” to the 
individual items in the inventory. The ICL statements 
were selected for investigation because it seemed of 
value to extend our knowledge of the relationship be- 
tween probability of endorsement and social desirability 
scale value with respect to new and different statements 
which had not been used in any of the previous studies. 

The 128 items in the ICL were given to a group of 
58 male and 39 female college students who were asked 
to make comparative judgments of the social desirabil- 
ity of each of the items. Judgments were made in terms 
of a nine-interval scale, the first interval being defined 
as highly socially undesirable and the last as highly 
socially desirable, following the same instructions used 
in an earlier study (1). Using these judgments, a scale 
value for each item can be obtained by the method of 
equal-appearing intervals. 

Before actually calculating scale values of the items, 
the distributions of judgments for the males and females 
were examined to determine whether the judgments ob- 
tained from the two groups could be pooled. This was 
done by determining the interval in which thescale value 
of each statement would fall for each group of judges 
considered separately. The correlation between these 
scale intervals over the 128 statements was .90, a value 
sufficiently high to warrant pooling the two distribu- 
tions. On the basis of the judgments of the combined 
group of 94 to 97 judges, equal-appearing interval scale 
values were found for the 128 items. The scale value 
for a particular item corresponds to the median of the 
distribution of judgments for the item on the nine- 
interval scale. The number of judges varies for the 
items because of failure to obtain complete sets of 
judgments for all 58 male judges. 

The ICL was then given to students registered in 
speech classes at the University of Washington with 
standard instructions for obtaining self-descriptions, 
except that they were told zot to put their names on 
their test booklets. In addition, the students were told 
that their instructor was not interested in identifying 
anyone’s answers and that the data were to be used in 
a study of sex differences. It was hoped that these in- 
structions would offer some assurance to the students 
that their answers would indeed remain anonymous. 


RESULTS 


Complete records were obtained from 67 female 
and 66 male students. For each of these two groups 
the frequency of endorsement of each item was 
found and transformed to a proportion. These 
proportions were regarded as the probabilities of 
endorsement for the various items. The product- 
moment correlation between the female and male 
probabilities over the set of 128 items was .95. 
Because of the high relationship between the 
two sets of probabilities, the data were pooled and 
a new set of probabilities obtained for the com- 
bined group of 133 Ss. 

Probability of endorsement of an item was then 
plotted against the previously determined social 
desirability scale value of the item. The plot was 
linear and the product-moment correlation be- 
tween the two variables was .83. 


396 


Discussion AND CONCLUSIONS 


If anonymity has any influence on self-descrip- 
tions, it would seem that the influence should be 
such as to produce more frank evaluations. Under 
anonymous conditions, therefore, one might ex- 
pect responses to personality statements to be less 
related to the social desirability scale values of 
the statements than when Ss are asked to sign 
their names to their self-assessments. The corre- 
lation of .83 obtained between the two variables 
of interest in the present study is, however, not 
significantly different from the value of .87 re- 
ported previously for a different set of statements 
and a different sample of Ss and where the assess- 
ments were not made anonymously. 

The correlation reported, of course, does not 
bear upon the problem of whether or not we would 
have obtained a significantly higher correlation 
for the same set of statements if Ss had been 
asked to sign their names to their self-descriptions. 
More important would seem to be the fact that 
the correlation obtained in the present study is of 
sufficient magnitude to indicate that assurance of 
anonymity does not eliminate nor drastically 
change the nature of the relationship previously 
found between probability of endorsement and 
social desirability scale value. 
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SOCIAL SETTING AND CONFORMITY TO A LEGAL REQUIREMENT!2 
ABRAM M. BARCH, DON TRUMBO, anv JOHN NANGLE 


Michigan State University 


HE PURPOSE of the study was to note the 
| effect on a subject’s behavior of observing 
others conform or fail to conform to a legal re- 
quirement. The specific hypothesis under test was 
that the signalling behavior of the driver of a 
motor vehicle preparing to turn at an intersection 
would be related positively to the signalling be- 
havior of the driver of an immediately preceding 
motor vehicle. 

The research was stimulated by the work of 
Blake and his associates concerned with the in- 
fluence of social background or social setting on 
conformity behavior (1, 2, 3, 4), but the data are 
also relevant to other formulations of imitation 


1 The study was supported by the Highway Traffic 
Safety Center of Michigan State University, 

2 The authors are indebted to Dr. Charles Hanley 
for statistical advice and assistance. 


and social conformity. In the situation studied, 
there is a minimum i i 

subject and the model, and the subjects can be 
assumed to have had considerable opportunity to 


METHOD 
The Observati ons 


Preliminary observations 
Variation in turn-si 
istic situation 
the study. 


Four different intersections in the Greater 
Lansing area were observed for a total of 61 hours 


indicated that the 
ignalling present in the natural- 
was sufficient for the purpose of 
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over a four-week period in the summer of 1956. 
The particular hours during which the intersections 
were observed depended on considerations in- 
volved in a broader study of turn-signalling be- 
havior; all data were collected during daylight 
hours and in good weather (no fog or rain). Inter- 
sections studied included two instances of two 
two-lane roads meeting at a two-way stop, one 
four-lane highway intersecting a two-lane road at 
a stop signal, and one six-lane divided highway 
intersecting a three-lane road at a stop signal. 

During any period of observation, the observer 
noted all cars moving from a given direction on a 
given leg of the intersection and recorded the 
following information for all turning cars: (a) 
presence or absence of a signal to turn; (b) direc- 
tion of turn; (c) presence of another car 100 feet 
or less behind the turning car when it began its 
turn; (d) sex of the driver. (At two of the seven 
legs observed, only cars turning in a given direc- 
tion were recorded because of the low occurrence 
of turns in one direction and/or the difficulty of 
accurately noting turns in both directions.) 

The observer stationed himself near the inter- 
section where he could best make the observations 
required. In most cases he was diagonally across 
the intersection from the observed traflic move- 
ment. At other times he stood about 100 feet 
ahead of the intersection along the lane(s) being 
observed. Data collected under control conditions 
designed to eliminate the possibility that drivers 
might note the presence of observers indicated 
that the visible presence of observers did not affect 
the percentage of cars signalling for a turn. In a 
reliability check, agreement between two observers 
for the same cars was 99 per cent or better for all 
categories except “the presence of a following car” 
(94 per cent). 

During the periods of observation, 4,229 male 
drivers and 1,004 female drivers of passenger 
cars turned at the locations under study. 


Criteria for Scoring Conformity 

Michigan state law requires that intention to 
turn be signalled by either hand signal or electric 
signal but does not specify the distance this signal 
must be given prior to turning or make a distinc- 
tion between a signal for a left turn and one for a 
right turn. Therefore, a driver was designated as 
signalling if he blinked his left or right turn signal 
light or gave any hand signal, except a hand 
signal for stopping, regardless of the direction of 


turn. 
RESULTS 


The first step in the analysis consisted of ex- 
tracting the appropriate pairs from the field 
observation data. The criteria for selection were: 
(a) both cars turned at the intersection under 


TABLE 1 


REACTIONS OF SUBJECTS TO SIGNALLING (S) 
AND NONSIGNALLING (NS) MODELS 


Behavior of Subject 


Behavior High Conformity Legs Low Conformity Legs 


of Model 


Right Turn | Left Turn | Right Turn | Left Turn 


s |ns| s |NS| S | NS S | NS 


S n | 140 | 74 | 167 | 62 48 |100 | 77 | 55 
%\ 65) 35| 73 27 | 32 | 68 | 58 | 42 
NS n g4 | 64| 36 | 21 | 42 |103 | 64 58 
%| 57|43| 63 | 37 29 | 71 | 52) 48 


study; (b) the following car was 100 feet or less 
behind the lead car when the lead car began its 
turn; and (c) no cars were in the same lane be- 
tween the two turning cars. A given car could 
appear in only one pair of cars; i.e., it could not 
be a following car for one pair and the lead car 
for the next pair. The total number of such pairs 
was 1,195. Of these, 723 pairs had a conforming 
model and 472 pairs had a nonconforming model. 

The signalling behavior of the follower was 
significantly related at the .01 level to the sig- 
nalling behavior of the lead car (x? = 15.78, 1 df). 
This positive relationship, although highly sig- 
nificant, was found to be a weak one. The strength 
of the association as estimated by tetrachoric r 
was .19. 

A future analysis of the data was felt desirable 
since the general level of conformity (percentage 
of all turning cars signalling without regard to 
presence of a model car) varied with the direction 
of turn and the leg of the intersection. Women 
also tended to signal more frequently than men. 
However, the difference was small and the pattern 
of relationships within the data for women was 
quite similar to that for men. Therefore, men and 
women drivers were combined in all comparisons, 

Table 1 presents the number and percentage of 
subjects signalling or failing to signal in relation 
to the signalling behavior of the model under 
various conditions of direction of turn of the 
subject and the general conformity level character- 
istic of the intersection leg at which the turn was 
observed. General conformity level for an inter- 
section leg was taken as the percentage of turn- 
signalling for all cars turning in a given direction 
without regard to the presence of a model car. 
Intersection legs were classified as high conformity 
if the percentage of turning-signalling for left 
turns was between 62.5 and 88 and the percentage 
of signalling for right turns was between 37.5 and 
62.5. Low conformity legs had a signalling per- 
centage between 37.5 and 62.5 for left turns and 
one between 20 and 37.5 for right turns. 

Table 2 presents the results of the x? analyses 
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TABLE 2 
STATISTICAL ANALYSES OF THE INFLUENCE 
OF THE MODEL 


Respondent Categories xe p Ttet 


All cases 1: 
All left turns 3 5 -2€ 
All right turns 4.82 -05 -14 
All cases at high con- 6 

formity legs 


All cases at low conform- 1.17 = -09 
ity legs 

Left turns at high con- 1.67 = -18 
formity legs 

Right turns at high con- 2.43 = 14 


formity legs 


Left turns at low con- 0.66 = 09 
formity legs s 
Right turns at low con- 0.27 — 07 


formity legs 


a All analyses have 1 df. 


of the influence of the signalling behavior of the 
model on the subjects signalling for various 
combinations of conditions. Tetrachoric r is pro- 
vided for each combination to show the strength 
of the association. In all comparisons the relation- 
ship was positive to the hypothesis of influence. 
The strength of the relationship was low for all 
analyses, ranging from .07 to .20. 

An attempt was made to increase the possible 
identification of the subject with the model by 
considering only those pairs in which both mem- 
bers of the pair were of the same sex and turned 
in the same direction. There were 691 such pairs. 
The x? for this analysis, as would be expected, 
was significant at the .01 level (x? = 7.33, 1 df). 
However, the tetrachoric r was found to be Th 
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essentially the same as that obtained for the other 
comparisons. 

The results supported the hypothesis that the 
behavior exhibited by others in a situation in- 
volving a legal norm influences a subject’s be- 
havior. The weakness of the demonstrated in- 
fluence should be considered in light of the lack of 
formal relationships or of interaction, except in a 
minimal sense, between the model and the subject 
and the subject’s ample opportunity to observe the 
behavior of others in the same or similar situation 
prior to the test observation. 


SUMMARY 


Observations made under normal traffic condi- 
tions were analyzed to determine the influence on 
the signalling behavior of automobile drivers of 
the signalling behavior of other drivers. A weak 
but significant positive relationship was found be- 
tween observation of conformity of others toa 
legal requirement and self conformity, 
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ITTES and Kelley (2) recently reported an 
experiment on conformity to group norms 


which showed, among other things, that the sub- 
ject’s experimentally induced feeling of acceptance 
within the group had a rather different effect on 
his private and public expressions of conformity 


This paper may be identified a 
A-236 of the Bureau of Applied Soci 
lumbia University, i 


th of a st x 
ducted by James Coleman, Elihu Kat udy con 
writer. 


versity 


to the group’s previously established judgment. It 
was found, in brief, that individuals whose feeling 
of being accepted in the group was high felt the 
greatest freedom to express any disagreement with 
feli S judgment publicly. Individuals whose 
oa & of acceptance was low exhibited high public 

©nlormity but showed little conformity in their 
Private expressions. Individuals who were made to 
feel an “average” level of acceptance exhibited a 

high degree of genuine adherence to the norms... 
extending even to Conditions of privacy.” The 
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present note addresses itself exclusively to the 
similarity or difference between public and private 
conformity, and not to the determination of private 
conformity as such. 


DATA 


The conclusions of Dittes and Kelley in this 
respect can be applied in slightly modified form to 
data, originally secured for another purpose (see 
1, 4), on the use of new drugs by physicians. 
Eighty-five per cent of the general practitioners, 
internists, and pediatricians practicing in four 
selected cities were interviewed regarding their use 
of a certain widely used family of drugs, comprising 
several new variants which had made their ap- 
pearance successively in the course of recent years. 
At the same time, the records of prescriptions writ- 
ten by these same doctors were examined in the 
local pharmacies for sampling dates extending over 
a period of 15 months. One may think of the 
physician’s writing of prescriptions as his private 
behavior, and of his declarations in the course ofan 
interview as the analogue of “public” expressions 
in the sense of the Dittes and Kelley experiment. 
So defined, “public” and “private” data are avail- 
able on three items of behavior: (a) the date on 
which the doctor used “gammanym,” the newest 
variant of the family of drugs, for the first time; 
(b) which of the three chief available variants he 
favored at the time the interviews were conducted; 
(c) which of the two chief variants then available 
he favored 15 months earlier. 


RESULTS AND DISCUSSION 


Existence of Norms 
In the fluid situation created by the recurrent 
appearance of new drugs, there is no established 
norm on record as to which of two or three com- 
eting variants a physician should prefer, nor as to 
the “right” date on which to introduce a new drug 
into one’s practice. What the norm in these respects 
was at the time the interviews were conducted can, 
however, be inferred from the prevailing direction 
of the discrepancies between private performance 
and public accounts of behavior: (a) One-half of 
the physicians stated as the date on which they 
first used the newest drug a date earlier—often 
many months earlier—than that established by 
the prescription search (Table 1)2 (b) Two-thirds 


2? The extent of this discrepancy can hardly be ac- 
counted for by the sampling of days in the prescription 
search. (Three consecutive days per month were sam- 
pled.) It is true that the presumed starting date of 
gammanym use, obtained by recording prescriptions on 
only a sample of days, can err only by being too late, 
and not by being too early. It is, however, possible to 
estimate the size of that error, since not only prescrip- 
tions for the new drug were recorded, but also prescrip- 
tions for its predecessors. It was found that during the 


TABLE 1 
DATE OF INTRODUCTION OF A New DRUG: 
INTERVAL BETWEEN INTERVIEW STATEMENTS 
AND PRESCRIPTION RECORD 


Date According to Interview Percentage 
of Doctors 
(N = 70) 
Follows prescription date by 2 months 24% 
or more 
Falls within one month’s distance of pre- 27 
scription date 
Precedes prescription date by 2-7 months 25 
Precedes prescription date by more than 24 
7 months 
100% 


TABLE 2 
DRUG Variant FAVORED AT TIME or INTERVIEW 


ii " Variant Favored According to 
‘Varjant:Ravored Prescription Record 


According to 


Interview Statement Oldest Middle Newest 
Oldest 33% 8% 6% 
Middle 25 32 18 
Newest 42 60 76 

Total 100% 100% 100% 

N (24) (25) (61) 


of the physicians whose prescription record at the 
time of interviewing showed predominant use of 
the oldest drug stated that they preferred one of 


average two-month period the proportion of doctors 
who were missed by the prescription search—i.e., for 
whom prescriptions for neither the new nor the older 
drugs were recorded on the sampling days falling within 
that period—was 23.2% (with a standard deviation of 
06%). This may be taken as an estimate of the propor- 
tion of doctors who were missed by the prescription 
search during the two months starting with their re- 
spective introduction of the new drug. But since few 
doctors, if any, made a complete switch from older 
drugs to newer drugs, one must add to the estimated 
23.2% who were altogether missed by the prescription 
search during these two months the doctors who were 
covered by the prescription search, but happened to 
prescribe only older drugs on the sampling days in 
question. This is estimated at 4.6% of all the doctors 
in the sample. Thus, if all the doctors had stated their 
introduction date accurately during the interview, the 
proportion who would, due to the sampling of days, 
nevertheless appear to have up-dated themselves by 
two months or more may be estimated at 23.2% 
+ 4.6% or 27.8%. Actually, as Table 1 shows, 49% 
appear to have up-dated themselves to this extent. 
The above estimate of 4.6% is based on the follow- 
ing considerations. Of all the prescriptions written for 
this family of drugs by doctors during the sample days 
within two months after their respective first sampled 
prescription of gammanym, 67% were for older drugs. 
The proportion of their prescriptions which ordered the 
older drugs during the two months following their first 
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creme PB PTION 
4 RESCRI D 
PANCIES IN DATE OF FIRST 
ay A New DRUG, By SOCIOMETRIC STATUS 


Number of Designations 
4i Received as Advisor 
tion Date According to 
TE eecviow Statement Sons Gene 
None | Two | More 
Earlier than by prescription) 63% | 33% | 36% 
d . 
care as by prescription 26 33 21 
ord® Cy 
bates than by prescription} 11 33 43 
record ae a = 
100% | 100% | 100% 
a (35) | (21) | (14) 


® Allowing a one-month tolerance. 


the newer drugs. Sixty per cent of those whose 
current prescription record showed predominant 
use of the middle one of the three drugs stated that 
they preferred the newest (Table 2). (c) A substan- 
tial portion (29%) of the 52 doctors who, according 
to their prescription record, had prescribed pre- 
dominantly the oldest variant 15 months before 
the interviews took place, failed to make any 
mention of this drug when recounting the history 
of their use of drugs of this general kind. 

From these results it may be concluded that 
there prevailed, at the time of the interviewing, 
norms in these communities of physicians which 
placed high value on (a) having introduced the 
newest drug early; (0) currently favoring the 
newest drug over its predecessors; (c) having 
favored the middle drug over the oldest 15 months 
earlier (i.e., before the appearance of the newest 
drug). Thus, with Tespect to all three items of 
behavior, the Preponderant direction of the dis- 
crepancies between the prescription record and 
the interview statements is such that the interview 
statements make the doctor appear more “yp-to- 
date.” This finding Supports the inference that 
the prevailing norm in these medical communities 
at the time of interviewing was generally favorable 
to “modernity,” at least in the use of drugs. The 


actual prescription of gammanym was probably not far 
different. The average doctor wrote 7.8 prescriptions for 
this family of drugs on the sampling days falling within 
two months of his first sampled Prescription of gam- 
manym, and probably as many on the sampling days 
falling within two months of his first actual Prescription 
of gammanym. The probability of all 7.8 of these pre- 
scriptions having been for the older drugs is therefore 
estimated as the seventh power of .67, which is .06, 
Applying this factor to the estimated Proportion of 
doctors not altogether missed by the prescription search 
during the two months starting with their introduction 
of gammanym (100% — 23.2% = 76.8%) yields a 
figure of 4.6%. 
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TABLE + 

DISCREPANCIES IN DATE OF First PRESCRIPTION 

or A New DRUG, IN RELATION TO SOCIOMETRIC 
STATUS, WITH ACTUAL PERFORMANCE 

HELD CONSTANT 


Percentage of Doctors who 

Stated a Drug Introduction . “ 
Date Earlier (Later) Than Number of Designations reeceived 
the Median Date State: as Advisor 

Those who Act 


i One Three 

neously with Them None or Two oF More 

Earlier than median 60% 45% 36% 
Later than median 40 55 (3 

Total 100% 100% 100% 

N (35) (21) (14) 


findings to be reported below, however, are of 
interest even if the above three sets of discrepancies 
merely express three separate norms relating 


specifically to the use of the particular drugs in 
question. 


Conformity and A cceplance in the Group 


When the physician’s standing in his medical 
community is taken into account, it is found that 
the discrepancies in the prevailing direction occur 
most frequently among the low-status physicians, 
who tend to make themselyes appear more up-to- 
date than they really are. An individual’s standing 
in the community of his colleagues is measured by 
the number of designations accorded him in re- 
Sponse to the sociometric question: “When you 
need information or advice about questions of 
therapy, ... on whom are you most likely to call? 
It is chiefly those seldom or never named in answer 
to this question whose interview statements make 
them appear more up-to-date than their prescrip- 
tion record. 

Table 3 shows this relationship for the date on 
which the doctor used the newest of the drugs for 
the first time. The contrast is quite marked, but 
is partly due to a ceiling effect: high-status indi- 
viduals did in fact tend to be more up-to-date m 
the timing of their first use of this new drug, and 
hence had less opportunity to deviate, in their 
interview Statements, in the up-to-date direction. 
However, the phenomenon persists even when 
actual performance is held constant, as shown 1n 
Table 4. Doctors were first divided into two-month 
intervals according to their drug introduction date 
as established by the prescription search. Within 
each such two-month interval, the median inter- 
view statement as to the introduction date of the 
drug was then determined. Each doctor was then 
classified according to whether his own interview 
Statement gave a date later or earlier than the 
median statement of doctors who actually intro- 
duced the drug within the same two-month inter- 
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i TABLE 5 
DISCREPANCIES IN DRUG FAVORE. 


D AT THE 


TIME OF INTERVIEWING, IN RELATION 


TO SOCIOMETRIC STATUS 


Number of Designations Received 
as Advisor: 


None 


One or Two 


Three or 
More 


Percentage who 
stated they fa- 
vored a mewer 
variant than 

prescription rec- 
ord shows, 
among those 
who prescribed 
predominantly: 
the oldest 
variant 
the middle 
7 variant 
l Percentage who 
stated they fa- 
vored an older 
variant than 
prescription 
record shows, 
among those 
who prescribed 
predominantly: 
the middle 
variant 
the newest 
variant 


67% (12) 
64% (11) 


9% (11) 
21% (24) 


17% (9) 
63% (8) 


0% (8) 
29% (14) 


33% (3) 
50% (6) 


17% (6) 
23% (13) 


TABLE 6 
DISCREPANCIES IN DRUG FAVORED 


15 MONTHS 


Prior TO TIME OF INTERVIEWING, IN 
RELATION TO SOCIOMETRIC STATUS 


Number of Designations Received 


as Advisor 
None Oncor Two Three a 


Percentage who 

stated they fa- 
N vored the newer 
variant, among 
those who pre- 
scribed predom- 
inantly the 
oldest 


33% (24)| 30% (20)| 13% (8) 


show @ 


lished from the prescription record? 


val as he did. It is seen that even when possible 
ceiling effects are eliminated by holding the date 


of introduction according to the prescription 
record constant, those of low standing in the group 
quite disproportionate tendency to report 
an introduction date which precedes that estab- 


3 None of the main results (Tables 4, 5, and 6) are 
statistically significant in the usual sense, but general- 
izability of the descriptive results beyond the four cities 
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Table 5 records the results obtained with respect 
to the second item of behavior: variant favored at 
the time the interviews were conducted. The 
necessity to control for actual performance here 
makes for very small base figures, and the results 
are somewhat less consistent than in Table 4, but 
they are substantially of the same order. Those of 
high standing in the group consistently “up-date” 
their behavior less than do those of low standing, 
although those of intermediate standing do not 
always fall into the expected place. The two last 
rows of Table 5 show that this difference is specific -` 
to distortions in the direction of the norm; the 
relatively few distortions that occur in the opposite 
direction are, if anything, more prevalent among 
those of high standing. 

Table 6 records the results for the third item 
of behavior: variant favored, among the two then 
available, 15 months prior to the time of inter- 
viewing. Once again, those of high standing in the 
community of colleagues only seldom reporte! 
having used, 15 months before the interviews took 
place, a newer drug than predominated among 
their recorded prescriptions, while those of low 
standing frequently made such reports. 


Discussion 


Thus, with respect to all three items of behavior 
on which information is available, doctors of low 
standing in the community frequently reported, 
during an interview, more up-to-date behavior 


surveyed could not be claimed in any case. (For Table 
4, collapsed to 2 x 2 format, chi square {corrected for 
continuity] = 1.721; dj = 1, $ = 096 for a one-tailed 
test. [See 5, Table 7, P- 125; 6, pP- 108—110].) If the 
results were based on a sample randomly drawn from a 
defined and much larger population, observations on 
the whole population might very well fail to confirm 
them. In fact, however, the observations were made on 
85% of the target population (general practitioners, 
pediatricians, and internists practicing in four cities). 
There is therefore little doubt that the findings would 
be confirmed by observations on the entire target popu- 
lation. On the other hand, neither the sample nor the 
target population can be claimed to represent any 
larger population, regardless of the outcome of signifi- 


Tt is quite another matter that the results are, of 
course, intended as contributory evidence of the more 
general applicability of Dittes’ and Kelley’s conclu- 
sions. Establishment of such a proposition ultimately 
depends on confirmation in a variety of concrete situa- 
tions (with diverse populations, norms, indicators of 
acceptance, and expressions of conformity), rather than 
on the generalizability of one particular finding to a 
larger population. They are here shown to apply to 
three items of behavior in one particular situation 
rather different from the original experiment. The posi- 
tion taken here is essentially that expressed at some 


length in Lipset et al. (3, pp. 427-432). 
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than is indicated by the prescription record, while 
doctors of high standing seldom did so. Since, as 
was shown above, the general norm favored being 
“up-to-date” in the three matters under investiga- 
tion (and perhaps in other maters as well), these 
results seem to be close parallels to the corre- 
sponding findings of the Dittes and Kelley 
experiment. 

One may note that the high-status doctors 
not infrequently deviated in their “public” (i.e. 
interview) statements from their prescription 
record in the “wrong” direction, that is, so as to 
appear less up-to-date than they actually were. It 
is not clear from Dittes’ and Kelley’s report 
whether individuals who agreed with the group 
privately ever expressed disagreement in public. 
If not, the contrast may be due to the fact that in 
the present study the norm of the group was not 
“crystallized and registered,” while the doctors’ 
statements were subject to a certain amount of 
unmotivated error due to faulty recall, which 
played no part in Dittes’ and Kelley’s experiment, 
It is also possible that there existed, uniquely 
among the high-status individuals, a competing 
norm in favor of conservatism; but this seems an 
unlikely explanation in view of the simultaneous 
finding that high-status individuals tended to be 
more up-to-date in fact than low-status individ- 
uals, at least in the date of their introduction of 
the newest of the drugs. (Why high-status indi- 
viduals were more up-to-date in their actual 
behavior will be discussed elsewhere. See also {1].) 


SUMMARY 


An experiment by Dittes and Kelley (2) had 
shown that individuals who felt accepted in their 
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group felt relatively free to express any disagree- 
ment with the group’s judgment publicly, while 
those whose feeling of acceptance was low ex- 
hibited much higher conformity in their public 
than in their private expressions. Data from a 
Survey among physicians provide an analogue in 
the form of interview statements about prescrip- 
tions of new drugs (public expressions) and the 
doctor’s actual prescription record (private be- 
havior). A norm of being up-to-date with respect 
to three choice situations is inferred from the 
Prevailing direction of discrepancies between 
private performance and public accounts, Accept- 
ance in the group was measured sociometrically. 
In all three choice situations, those of low accept- 
ance reported more up-to-date behavior than 
indicated by the Prescription record, while doctors 
of high acceptance seldom did so. 
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ONSISTENT with the deductions of Lewin (5), 
C Hovland and Sears (4) found that avoidance- 
avoidance conflicts produced more blocking and 
compromise responses in the manual motor situa- 
tions they studied than did approach-approach 
ones. Barker (1), however, found no consistent 
evidence that choice time and “vicarious trial and 
error behavior” were greater in children choosing 
between paired repellent liquids than between 


1 The author wishes to acknowledge the generous 
assistance of Dr. John M. Digman, 


attractive ones. In a second experiment (2), he 
Presented college students with Paired hypothetical 
alternatives involving personal characteristics and 
environmental conditions and found that the 
frequency of choices marked “uncertain” was 
8reater for negative than Positive alternatives. 
The present investigation attempted to devise 
comparable approach-approach and avoidance- 
avoidance situations that would involve the sub- 


jects emotionally and Permit clear-cut measures 
of conflict behavior. 
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APPROACH-APPROACH AND AVOIDANCE-AVOIDANCE CONFLICTS 


METHOD 


Forty male and 40 female students from the intro- 
ductory psychology course served as subjects (Ss). 
Conflicts were created by systematically pairing with 
one another seven personal characteristics: adjustment, 
attractiveness, health, intelligence, popularity, talent, 
and wealth. Two types of pairings were employed, 
one in which the S was required to designate the 
alternative that he would rather have in greater degree 
than he had at present (approach-approach conflict), 
and a second in which he was required to designate the 
alternative that he would rather have in lesser degree 
than he had at present (avoidance-avoidance conflict). 
In the 42 conflicts presented, 21 described approach- 
approach situations and 21 described avoidance- 
avoidance ones. Each choice was typed on a separate 
3 x 5 card with the question “Which would you rather 
be?” centered above, and the alternatives boxed below 
on left and right portions of the card. An example of 
an approach-approach conflict follows (for the avoid- 
ance-avoidance analogue “less” was substituted for 
“more”’): 


Which would you rather be? 
More attractive than More intelligent than 
you are now. you are now. 


In the first part of the experiment, the experimenter 
(E) arranged the 42 conflict cards in random order and 
placed them face down in a single stack before the S 
who was given written instructions as follow: 

Each card describes a conflict which you must 
resolve. When the experimenter gives the signal, 
turn up the topmost card and study the conflict 
presented. If your choice is the alternative on the 
left, place the card, face down, on the table to the 
left of the pile. If your choice is the alternative on 
the right, place the card, face down, on the table to 
the right of the pile. n 

Pay no attention to the timing. Take as much 
or as little time as you like with each card. 

Imagine that each conflict really confronts you. 
Be sure that your choice is the one that you would 
really make if you really had to decide. 

As soon as the S turned up the conflict card (and before 
he actually began to read the conflict) the Æ began the 
timin? Timing was stopped as soon as the card was 
face down and flat on the table. 

The conflict cards were then placed once more in 
random order, and the S$, without timing, was given 
written instructions as follow: 

Divide the cards into two piles of 21 cards each. 
In one pile place the cards describing the conflicts 
which you found relatively easier to resolve. In the 
other pile place the cards describing the conflicts 

hich you found relatively harder to resolve. You 


w V 
s in any fashion you wish. 


may proceed to do thi 


RESULTS 


Two measures of conflict behavior were 
available: the amount of time taken to resolve the 
two types of conflict, and the number of conflicts 
of each type judged to be easier to resolve. 

The mean resolution times taken for each type 
of conflict by the male and female groups are con- 
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TABLE 1 
MEAN RESOLUTION Tres (IN SEC.) FOR 
AppROACH-APPROACH AND AVOIDANCE- 
AVOIDANCE CONFLICTS OF MALE 
AND FEMALE SUBJECTS 


Conflict 


Sex Avoidance- 


Approach- 
Avoidance 


Approach 


Male 164.32 
Female S. 142.00 
Both 122.00 153.16 E 


TABLE 2 
ANALYSIS OF VARIANCE OF RESOLUTION 
TIME FOR ÅVOIDANCE-AVOIDANCE AND 
ÅPPROACH-APPROACH CONFLICTS 


Mean 


df square F 


Source of variation 


110.28* 
3.14 
2.04 


1 38844.06 
1 13086.30 
1 718.26 
8 4172.52 
8 352.22 


Between conflicts 
Between sexes 

Conflict X sex 

Between Ss within groups 7 
Conflict X Ss within groups 7: 
Total 159 


* Significant at the .01 level. 


TABLE 3 
MEAN NUMBER or Approacu-APPROACH AND 
AvoipANCE-AVOIDANCE CONFLICTS 
Jupcep Easier TO RESOLVE 


Conflict 
Sex Approach- Avoidance- 
Approach Avoidance 
Male 12.72 8.28 
Female 12,55 8.45 
Both 12.64 8.36 


tained in Table 1. The results of an analysis of 
variance, presented in Table 2, indicate that the 
difference between the time taken to resolve the 
approach-approach and avoidance-avoidance ‘con- 
flicts was significant at the .01 level of confidence. 
The interaction between sex and the experimental 
condition was not significant. The data were fur- 
ther analyzed for possible differences in variability 
between sexes; at the .05 level of confidence, the 
hypothesis of equal variability could not be 
rejected (F = 1.03 for approach-approach scores 
and 1.48 for avoidance-avoidance scores, df = 39 
and 39). 

The mean numbers of each type of conflict 
judged easier to resolve by the male and female 
groups are contained in Table 3. An analysis of the 
difference in variability between sexes showed that 
at the .01 level of confidence the males were more 
variable than the females (F = 1.98, df = 39 and 
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39). A ¢ test of the type suggested by Gronow (3) 
for use with groups differing in variability indi- 
cated that at the .05 level of confidence the null 
hypothesis in respect to sex differences could not 
be rejected (£ = .20, df = 39 and 39). Assuming the 
null hypothesis tenable, the two sex groups were 
then combined to test the hypothesis that the mean 
number of approach-approach conflicts judged 
easier to resolve minus the mean number of avoid- 
ance-avoidance conflicts similarly judged was 
zero; the results indicated that the null hypothesis 
could be refuted considerably beyond the .01 
level of confidence (¢ = 5.74, df = 79). 


CONCLUSION 


The results of the present experiment are in 
agreement with Lewin’s deduction that approach- 
approach conflicts are more easily resolved than 
are avoidance-avoidance conflicts, Approach- 
approach conflicts required significantly less time 
to resolve than did avoidance-avoidance conflicts, 
and significantly more approach-approach con- 
flicts were judged easier to resolve than were 
avoidance-avoidance conflicts. Males and females 
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did not appear to differ in the conflict behavior 
except that in judging the difficulty of conflicts, 
males proved to be significantly more variable 
than females. 


REFERENCES 


1. Barker, R. G. An experimental study of the reso- 
lution of conflict by children, In Q. McNemar & 
Maude A. Merrill (Eds.), Studies in personality. 
New York: McGraw-Hill, 1942. Pp. 13-34, 

2. Barker, R. G. An experimental study of the rela- 
tionship between certainty of choice and the 
relative valence of the alternatives. J. Pers., 1946, 
15, 41-52. 

3. Gronow, D. G. C. Test for the significance of the 
differences between means in two normal popula- 
tions having unequal variances, Biometrika, 1951, 
38, 252-256. 

4. Hovranp, C. I, & Sears, R. R. Experiments on 
motor conflict. I. Types of conflict and their 
modes of resolution. J. exp. Psychol., 1938, 23, 
477-493. 

5. Lewin, K. A dynamic theory of personality, New 
York: McGraw-Hill, 1935, 


Received October 15, 1956, 


ERIMENTAL CHANGE OF 
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HE QUESTION of the consistency of adult 
| personality is a continuing problem in the 
field of personality theory. Kelly, in his longi- 
tudinal study of marital adjustment, found evi- 
dence of a remarkable degree of consistency as 
well as some evidence of significant changes over 
a 20-year period (2). One of the questions that 
remains is the extent to which personality change, 
when it occurs, can be understood in terms of the 
effect of specific environmental conditions. The 
present note is intended to report results bearing 
on this question from an experimental study ina 
large clerical organization. Other aspects of this 
study have been reported earlier (4, 5, 7). 

Two experimental groups were created that 
differed in their distribution of control. In one, 
called the autonomous Program, the amount of 
control exercised by lower-level employees was 
increased and that of the upper hierarchical levels 
decreased. In the other, called the hierarchical 
program, the amount of control exercised by the 
rank and file was decreased and that of the upper 
levels increased. The program for each group of 
approximately 200 female employees continued for 
over a year. A paper-and-pencil questionnaire 


designed Specifically to measure 26 personality 
trends to which each of the Programs had some 
degree of relevance was administered near the 
beginning of the experimental period, and about a 
year later at the end. Results reported earlier (7) 
indicated that subjects’ reactions to the work 
programs can be understood partly in’ terms of the 
relationships between their Personality structures 
and the nature of the Program in which they were 
located. It is of further interest, however, to 
investigate personality as a dependent variable; 
the extent to which subjects’ personality trends 
are found to change in predictable ways as a result 
of their experiences in the experimental programs. 

It was initially assumed that personality trends 
are quite stable and would be unlikely to change 
as a result of experimental treatment Neverthe- 
less, the two Programs were seen to have profound 
effects on the subjects’ attitudes toward the com- 
pany and satisfactions in it. In addition, the dura- 
tion of the experiment, as well as the seriousness 
with which the subjects reacted to it, raised the 
question of its possible effects on the relevan 


1 For the conception of personality used, see Allport, 
F. H. (1). 
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TABLE 1 


BEFORE-AFTER CORRELATIONS 


AND INCIDENCE OF CHANGE 


(Change in the predicted direction significant at the .05 level is indicated by +; change in a direction opposite 
to that predicted by -) 


Autono- Hierar- 


Before- mous 
after r Program Program 
N= N= N= 
195)" 115)" 80)" 
Autonomous Trends 
1. assume responsibility (anticipating, preparing for, and accepting the con- 59 n.s. n.s. 
sequences of one’s activities) 
2. be creative (develop new patterns; to use one’s imagination; to think up new 56 n.s. n.s. 
and original ways of doing things) 
3. increase the variety (number of types) of satisfactions in any given situation (to 4 n.s. n.s. 
enjoy as many different things as possible in the same situation, e.g., to “mix 
business with pleasure”) 
4. understand and explain the reasons for one’s behavior and opinions (to under- AS n.s. n.s. 
stand and explain one’s feelings) 
5. help others 59 ns. n.s. 
6. understand others’ viewpoints and feelings (to know why people feel or act as -60 n.s. S 
they do; to understand the reasons for others’ behavior) 
7. be independent of control figures (to be free of direction and control of persons „54 n.s. 0 
in authority) 
8. express one’s considered judgments (to think through things carefully and state 78 nS. ca 
one’s opinions) 
9, base one’s actions on one’s own critical judgments and evaluations (to carefully 38 a= + 
evaluate and decide things for oneself) 
10. act on the same level as others (to treat and be treated by others as an equal) .52 n.s: nS. 
11. understand relations between and reasons for things (to understand the basis for 12 Ca ar 
40 n.s. T 


things; to figure things out; to know “why”) 
g5; 


12. do one’s own thinking (not be suggestible; to decide things for oneself; to draw 


one’s own conclusions) 


13. take the initiative with others (to start things; to organize things) .42 n.s. = 
Hierarchical Trends s 

14. avoid emotional involvements (to avoid the expression and reception of emo- „65 n.S. n.s. 
tional acceptance; to, keep emotionally distant from others; to avoid being 
“warm” in relationships with others; to avoid affection) 

15. conform to the wishes of control figures (to please persons in authority) -60 n.s. n.s. 

16. adhere rigidly to specific rules or directions (to follow rules to the letter; to be .58 n.s. n.s. 
given directions for doing things) 

17. act inferior to control figures (to be humble in relation to persons in authority) -60 + n.s. 

18. avoid committing oneself (to keep one’s opinions to oneself; to keep from getting .40 n.s. n.s. 
oneself “out on a limb”; to keep people from knowing where one stands on con- 
troversial questions) y 4 , ; à ch 

19. keep one’s roles (behavior categories, relationships) at any giyen time distinct 40 n.s. + 
from other roles (to keep various parts of one’s life separated) 

20. admire or respect ‘control figures or symbols (to look up to persons in, or symbols 68 n.s. n.s. 
of, authority. 

21. be superior to others (to be better than others; to be more outstanding 69 + - 
than others) . 

22. bereflicient (not waste time or effort) 58 4 pe 

23. be submissive to control figures (to submit to dominant leaders; to obey strong .45 n.s. + 
and forceful authority figures) 

24. show self-discipline (to show self control; to keep one’s behavior disciplined) .42 + = 

25. depend on others (to receive direction from others) 48 ns. nis: 

58 ns. n.s. 


26. obey rules and follow directions 


a In some cases the N’s ar 


personality variables. A number of hypotheses 
regarding change in personality trends were 
therefore formulated on the basis of the assumption 
that environmental conditions may increase or 
decrease the characteristic energic (homeostatic) 
Jevel of trends on the basis of a simple reward-and- 
punishment principle: characteristics that are 
given opportunities for expression tend to increase 


e slightly less than this value due to a very small proportion of NA’s—never more than 4%. 


in potency; trends that are given only minimal 
opportunities for expression tend to decrease. 

The 26 trends measured by the personality 
questionnaire were divided into two groups of 
thirteen; “autonomous trends,” which were likely 
to be given opportunities for expression in the 
autonomous program (and not in the hierarchical), 
and “hierarchical trends,” which were likely to be 
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given opportunities for expression in the hier- 
archical program (and not in the autonomous). It 
was hypothesized that (a) in the autonomous 
program, autonomous trends should tend to 
increase in potency and hierarchical trends tend 
to decrease; (b) in the hierarchical program, 
hierarchical trends should tend to increase in 
potency and autonomous trends tend to decrease. 
Care was taken in framing the questions used 
to measure the personality trends to make them 
as general as possible and distinct from the com- 
pany situation. Responses to these questions are 
therefore not likely to reflect the subjects’ changing 
attitudes toward the experimental programs or 
toward the company. Five questions were em- 
ployed to measure each trend. For example, one 
of the questions for the trend, “trying to be inde- 
pendent of control figures,” was “How much do 
you believe that people generally require someone 
to tell them what to do?” For the trend, “trying 
to understand others’ viewpoints and feelings,” we 
asked, “How much does it bother you when you do 
not understand the viewpoints and feelings of 
people you associate with?” For the trend, “trying 
to keep one’s role at any given time distinct from 
other roles,” we asked, “Generally, how good an 
idea do you think it is to choose very close friends 
from among persons you work with?” For the 
trend, “trying to do one’s own thinking,” we asked 
“How hard do you find it to disagree with others 
even in your own thinking?” For the trend, “trying 
to act inferior to control figures,” we asked, “Do 
you think that people who have reached high office 
in government have a right to feel that they are 
better than most people?” For the trend, “trying 
to be submissive to control figures,” we asked, 
“Do you think that for a truly happy marriage 
that in major issues the husband should tell the 
wife what to do?” 


RESULTS 


Table 1 shows the before-after correlations (test- 
retest reliabilities) for each of the trends, and the 
direction of changes significant at the .05 level or 
less. A plus sign indicates that the change is in 
the direction predicted; a minus sign, a change in 
the opposite direction. Although all of the predic- 
tions are directional, two-tailed rather than one- 
tailed tests are employed to permit ascribing levels 
of confidence to changes contrary to hypothesis.’ 

Twelve changes in the predicted direction 
prove significant at the .05 level of confidence. Six 
changes, significant at the .05 level occur ina 
direction opposite to that predicted.’ The combined 


? The following statistical formula, designed specifi- 
cally to test the significance of change, was employed: 
t= (A — D — 1/ VA F D) (3, p. 207). 

3 The pattern of results remains essentially the same 
when evaluated in terms of the .01 level of confidence. 


results cannot be assessed Statistically since the 
trend measures are not independent, but it seems 
reasonable to conclude that some change has 
occurred and that it is predominantly in the pre- 
dicted direction. 

The preponderance of changes in the direction 
of decreasing trend potency is hard to explain. 
Most of the “correct” changes involv: 
of autonomous trends in the hierarchical program 
or the decrease of hierarchical trends in the auton- 
omous program. Likewise, five of the six “incor- 
rect” changes involve the decrease of autonomous 
trends in the autonomous program or hierarchical 
trends in the hierarchical program, No single 
trend is found to change significantly in the pre- 
dicted direction in both programs. 

Although somewhat ambiguous, the data seem 
to indicate that measurable change can be effected 
by a persisting change in environmental conditions. 
Furthermore, the change seems partly explicable 
in terms of the movement of personality toward 
equilibrium with its environment. A type of 
organism-environment reciprocity is suggested in 
which the organism selects, reacts, and adjusts to 
its environment, so as to optimize personality 
“closure” while at the same time modifying itself 
within limits so as to further increase trend expres- 
sion. This integral character of inner needs and 
outer conditions, as a general psychological 
principle is consistent with the Position taken by 
Murphy: “A personality is a structured organism- 
environment field, each aspect of which stands in 


dynamic relation to cach other aspect. There is 


organization within the organism and organization 
within the environment, but it is the cross-organ- 
ization of the two that is investigated in personality 
research” (6, p. 8), 


e the decrease 
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In this case, eight changes prove “correct” and fou 
“incorrect.” 


mi. 


